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OM protein - protein search, using sw model 



Run on : 



February 10, 2005, 17:39:57 ; Search time 476 Seconds 

(without alignments) 
912.812 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-892-287-1 
2020 

1 MDVLS PLS F I KVSHVRMQGI FQI PPGSLREDPLGEAQPQI 372 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



6959266 seqs , 1168006243 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
US-09-892-287-1 

; Sequence 1, Application US/09892287 
GENERAL INFORMATION: 

APPLICANT: Hillman, Jennifer L. 



Lai, Preeti 
Corley, Neil C. 
Shah, Purvi 



TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYLINOSITOL 



NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY: USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/892 , 2 87 

FILING DATE: 26-Jun-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/258,643 

FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION : 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 

TELE PHONE : 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO : 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 372 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRSTNOT03 



(4,5) BISPHOSPHATE 5 -PHOSPHATASE 



CLONE: 638789 
SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
US-09-892-287-1 



Query Match 100.0%; Score 2020; DB 23; Length 372; 

Best Local Similarity 100.0%; Pred. No. 6.7e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II II I III II III ; Ml II 1 1 1! 1 1! Ill 1 1 III I: I Ml 1 1 1 Ml II I III I II I II 1 1 

Db 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

1 1 il Ml I II Ml M 1 1 III I II I II III II II 1 1 II II 1 1 1 III II I II 1 1 II I II II 

Db 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 1 1 II 1 1 1 1 1 1 M II I M 1 1 II 1 1 1 II I II II II M I M I II 1 1 1 1 II 1 1 II II I M II 

Db 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

II II I II II I II I MM II 1 1 1 III III Mill II I II 1 1 II MM 1 1 1 1 1 1 II 1 1 II 

Db 181 DTSEKKRKPAWTDR ILWRLKRQPCAGPDTP I PPASHFSLSLRGYS SHMT YG I SDHKPVSG 240 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

III I II III III MM II 1 1 1 III I Mill MM II II 1 1 1 MM 1 1 1 1 1 1 III M I! 

Db 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

II 1 1 1 II 1 1 1 II I M II II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 I II 1 1 II II I M 1 1 1 1 1 1 1 1 1 1 1 

Db 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

Qy 361 REDPLGEAQPQI 3 72 

IIMIIIIIMI 
Db 3 61 REDPLGEAQPQI 372 



RESULT 2 

US-10-990-328-8797 

Sequence 8797, Application US/10990328 
GENERAL INFORMATION: 
APPLICANT: CARGILL, Michele 



TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



POLYMORPHISMS IN NUCLEIC ACID MOLECULES 
ENCODING HUMAN ENZYME PROTEINS, METHODS OF DETECTION AND 
USES THEREOF 
FILE REFERENCE: CL001495 

CURRENT APPLICATION NUMBER: US/ 10/ 990 , 32 8 
CURRENT FILING DATE: 2004-11-17 
NUMBER OF SEQ ID NOS : 558824 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 8797 
LENGTH: 3 72 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-990-328-8797 

Query Match 100.0%; Score 2020; DB 35; Length 372; 



Best Local Similarity 100.0%; Pred. No. 6.7e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MDVLS PLS F I KVSHVRMQGI LLLVFAKYQHLPY IQI LSTKSTPTGLFGYWGNKGGVNI CL 60 

I Ml II I II II II I II II II INI! Illllll II 1 1 III I II I II I Ml II l : I II 1 1! I 

Db 1 MDVLS PLS F I KVSHVRMQGI LLLVFAKYQHLPY IQI LSTKSTPTGLFGYWGNKGGVNI CL 60 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

I M 1 1 1 1 II Ml 1 1 II II IMIMI IMMM II I MM I II I M I Ml M I M M M I 

Db 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

I M 1 1 1 1 II II 1 1 IMMM I MM MM III 1 1 1 IM M 1 1 1 I MM II IM 1 1 I 1 1 

Db 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

I II 1 1 II M II I M II M M I MM MM III III II I MM I I II II II IM IM IM 

Db 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

Qy 241 TFDLELKPLVSAPLI VLMPEDLWTVENDMMVS YSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

IMIIMIIIIIIMIMIIMIII MMMMIMI IIIIMIIIIIII IMMM 

Db 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVS YSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

MMMIMMMMMIMIMM MMMMIMI IIIIMIIIIIII IMMM 

Db 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

Qy 3 61 REDPLGEAQPQI 3 72 

MINIMUM 
Db 3 61 REDPLGEAQPQI 3 72 



RESULT 3 

PCT-US02-17526-14 

Sequence 14, Application PC/TUS0217526 
GENERAL INFORMATION: 
APPLICANT: EXELIXIS, INC. 

TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX02-074C-PC 

CURRENT APPLICATION NUMBER: PCT/US02 /1752 6 
CURRENT FILING DATE: 2002-06-03 
PRIOR APPLICATION NUMBER: US 60/296,076 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/328,605 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/357,253 
PRIOR FILING DATE: 2 002-02-15 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 14 
LENGTH: 44 8 
TYPE: PRT 

ORGANISM: Homo sapiens 
PCT-US02-17526-14 



Query Match 100.0%; Score 2020; DB 1; Length 448; 

Best Local Similarity 100.0%; Pred. No. 8.9e-198; 



Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



1 MDVLS PLS F I KVSHVRMQG I LLLVFAKYQHLPYI Q I LSTKSTPTGLFG YWGNKGGVN I CL 60 

M 1 1! Ill II 1 1 II I II I II 1 1 II III MM Ml III II 1 MM INI 1 1' 1 1 ! II I II I 

77 MDVLS PLS F I KVSHVRMQG I LLLVFAKYQHLPYI QILSTKSTPTGLFGYWGNKGGVNICL 136 
61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

M I M II I M I II 1 1 1 MM 1 1 1 II 1 1 1 1 1 1 1 II I II 1 1 1 II 1 1 1 M II 1 1 1 1 1 M 1 1 II I 

137 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 196 
121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

M 1 1 1 II I II M 1 1 1 1 M 1 1 M 1 1 II II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 

197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 
181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

! I III 1 1 Ml I II I II I M I II II II I III I MM 1 1 III 1 1 1 1 II I II I M II III II I 

257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 
241 TFDLELKPLVSAPLIVXMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

! I MM I MM I II II I Ml II II MIMIIII MM 1 1 1 1 1 Mill II I M I Ml III I 

317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 376 
3 01 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

1 1 1 MM I II I II 1 1 1 M 1 1 1 MM I III I II II III II I II II 1 1 II I I I II M I II I 

377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 43 6 

3 61 REDPLGEAQPQI 372 

MINIMUM 

4 37 REDPLGEAQPQI 44 8 



RESULT 4 

US-10-161-510-14 

Sequence 14, Application US/10161510 
GENERAL INFORMATION: 
APPLICANT: EXELIXIS, INC. 

TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX02-074C 

CURRENT APPLICATION NUMBER: US/ 10/ 161 , 510 
CURRENT FILING DATE: 2002-06-03 
PRIOR APPLICATION NUMBER: US 60/2 96,076 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/328,605 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/357,253 
PRIOR FILING DATE: 2002-02-15 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 14 
LENGTH: 44 8 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-161-510-14 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches 372; Conservative 



Score 2020; DB 27; Length 448; 
Pred. No. 8.9e-198; 
0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II II I: II! II III I < II I M lllllll II I II II INI II MM Mill II II II I III 

Db 77 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 13 6 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

M I II I MM I Ml 1 1 1 1 1 1 MM Ml II II 1 1! II III I MM III 1 1 1 II M M II I 

Db 137 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 196 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

M II I Mill I II 1 1 1 II I M IMM Mill 1 1! I llllll II II! M III II Ml 1 1 1 1 

Db 197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

M II I Ml II I III I Ml M I II II II I II II III I II 1 1 1 MM I II I M II II 1 1 1 1 

Db 257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

M II M II 1 1 1 III I M 1 1 1 1 MM MM! II III II II II III 1 1 II I MM II I II I 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 76 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

II 1 1 1 II III I Ml 1 1 ! 1 1 1 I III I II II II II I Ml III 1 1 M II I Ml II I II 1 1 1 

Db 377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 436 

Qy 361 REDPLGEAQPQI 372 

MIMIMMII 
Db 43 7 REDPLGEAQPQI 44 8 



RESULT 5 

US-10-990-328-8795 

Sequence 8795, Application US/10990328 
GENERAL INFORMATION: 
APPLICANT: CARGILL, Michele 



TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



POLYMORPHISMS IN NUCLEIC ACID MOLECULES 
ENCODING HUMAN ENZYME PROTEINS , METHODS OF DETECTION AND 
USES THEREOF 
FILE REFERENCE: CL0014 95 

CURRENT APPLICATION NUMBER: US/ 10/990 , 32 8 
CURRENT FILING DATE: 2004-11-17 
NUMBER OF SEQ ID NOS : 558824 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 87 95 
LENGTH: 44 8 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-990-328-8795 

Query Match 100.0%; Score 2020; DB 35; Length 448; 

Best Local Similarity 100.0%; Pred. No. 8.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I II 1 1 1 III I II I II I M II II III II III MM M M I I Ml I! Ill MM I II I IM 

Db 77 MDVLS PLSFI KVSHVRMQGILLLVFAKYQHLPY I QI LSTKSTPTGLFGYWGNKGGVNI CL 136 



Qy 61 KL YGY YVS I INCHLPPH I SNNYQRLEHFDRI LEMQNCEGRD I PN I LDHDL I I WFGDMNFR 120 

I I II I 1 1 II Ml ; II I II 1 1(1(1 1 II III 1 MM I I II II II I II II I 1 1 III 1 1 I II I 

Db 137 KLYGYYVS I INCHLPPH I SNNYQRLEHFDRI LEMQNCEGRD I PN I LDHDL I I WFGDMNFR 196 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 II I II I MM Ml 1 1 M Ml II I II MM I II M I II II II 1 1 II M 1 1 III I M II ( 

Db 197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

II I II II I MM Ml I Ml I II III II III II II II Ml II II III I II M II II M Ml 

Db 257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

II I II III MM Mi 1 1 M III Ml II II III II II III II II II 1 1 1 1 M II II M 1 1 1 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 376 



Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

II I II M I II II II II II II II 1 1 1 II I II II 1 1 II 1 1 II 1 1 1 M I II I M 1 1 II II I II 

Db 377 VS YAWVGDSKVSCSDNLNQVYIDI SNI PTTEDEFLLCYYSNSLRS WGISRPFQI PPGSL 436 

Qy 361 REDPLGEAQPQ I 372 

Illlllllllll 
Db 437 REDPLGEAQPQ I 44 8 



RESULT 6 

US-10-990-328-8796 

; Sequence 8796, Application US/10990328 

; GENERAL INFORMATION: 

; APPLICANT: CARGILL, Michele 

TITLE OF INVENTION: POLYMORPHISMS IN NUCLEIC ACID MOLECULES 

TITLE OF INVENTION: ENCODING HUMAN ENZYME PROTEINS, METHODS OF DETECTION AND 
; TITLE OF INVENTION: USES THEREOF 
; FILE REFERENCE: CL001495 

; CURRENT APPLICATION NUMBER: US/10/ 990 , 32 8 
; CURRENT FILING DATE: 2004-11-17 
; NUMBER OF SEQ ID NOS : 558824 

SOFTWARE: FastSEQ for Windows Version 4.0 
/ SEQ ID NO 87 96 

LENGTH: 44 8 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-990-328-8796 



Query Match 100.0%; Score 2020; DB 35; Length 448; 

Best Local Similarity 100.0%; Pred. No. 6.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I II I II M II M I II 1 1 II II III I II MM II I II I II 1 1 III II 1 1 II I MM II 1 1 1 

Db 77 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 136 



Qy 61 KLYGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

Ml II M I II II 1 1 II I M II Ml II II III II 1 1 II II 1 1 III M I II Mill II MM 

Db 137 KLYGYYVS I INCHLPPH I SNNYQRLEHFDRI LEMQNCEGRD I PN I LDHDL I IWFGDMNFR 196 



Qy 



121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 



II II III 1 1 II II III Mi 1 1 i I II II II III 1 1 II I Mill I II I II INI 'I II II II 

Db 197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTP I PPASHFSLSLRGYSSHMTYGI SDHKPVSG 24 0 

II III 1 1 1 1 II 1 1 MM 1 1 II I Ml MM I II I III I II I II I II I II I III I II M M 

Db 2 57 DTSEKKRKPAWTDRI LWRLKRQPCAGPDTP I PPASHFSLSLRGYSSHMTYGI SDHKPVSG 316 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDWDY 3 00 

II I Ml M III II III MM 1 1 1 II II I II M II II I 1 1 1 III I II II I II I I II I Ml 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 376 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

III III 1 1 III I II MM II 1 1 1 II II I II II MM I 1 1 II III I II II II I I II 1 1 M 

Db 377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 436 

Qy 3 61 REDPLGEAQPQI 372 

1 1 1 1 1 1 1 1 r 1 1 1 

Db 43 7 REDPLGEAQPQI 44 8 



RESULT 7 

US-10-170-2 05E-3 23 68 

Sequence 32368, Application US/10170205E 
GENERAL INFORMATION: 
APPLICANT: ADAMS , Mark 

TITLE OF INVENTION: DEVICES, SUCH AS ARRAYS, COMPRISED OF HUMAN PROTEINS OR 
PROTEIN 

TITLE OF INVENTION: CAPTURE AGENTS, AND USES THEREOF 
FILE REFERENCE: CL0013 81 

CURRENT APPLICATION NUMBER: US/ 10/ 17 0 , 2 05E 
CURRENT FILING DATE: 2002-06-13 
NUMBER OF SEQ ID NOS : 40312 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 32368 
LENGTH: 44 9 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-170-2 05E-32 3 68 

Query Match 100.0%; Score 2020; DB 27; Length 449; 

Best Local Similarity 100.0%; Pred. No. 8.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I III 1 1 1 II 1 1 II 1 1 1 1 II 1 1 Mill 1 1 1 II II 1 1 II 1 1 1 II M 1 1 1 III I M II M 

Db 7 8 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 13 7 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

II I MM I III 1 1 II I II I II II II M 1 1 1 1 1 1 1 1 1 1 II II I II M 1 1 1 II M Ml II M 

Db 13 8 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 197 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II I MM 1 1 II 1 1 II I Ml II II M M 1 1 1 1 1 1 1 II MUM II II 1 1 1 II M Ml II M 

Db 198 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 257 

Qy 181 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTP I PPASHFSLSLRGYSSHMTYGI SDHKPVSG 24 0 

II I MM 1 1 II II Ml II I MM II IMIIMI I III II III II II II I II M Ml M II 



Db 



258 



DTS EKKRKPAWTDR I LWRLKRQPCAG PDTPI PPASH F SLSLRGYS SHMTYG I SDHKPVSG 317 



Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

II 1 1 1 1! 1 1 1 1 1 II II 1 1 MINI 1 1 Ml! 1 1 1 MM 1 1 II I III I II II! 1 1 1 Ml IN 

Db 318 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 77 

Qy 301 VSYAWVGDSKVSCSDNLNQWIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 360 

II M I II M I M M IM 1 1 Mill I III 1 1 M I MM 1 1 II I II 1 1 1 1 MM I II MMI 

Db 37 8 VSYAWVGDSKVSCSDNLNQWIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 437 

Qy 361 REDPLGEAQPQI 372 

1 1 1 1 1 1 1 1 1 1 r i 

Db 43 8 REDPLGEAQPQI 44 9 



RESULT 8 

US-10-990-328-8794 

Sequence 8794, Application US/10990328 
GENERAL INFORMATION: 
APPLICANT: CARGILL, Michele 

TITLE OF INVENTION: POLYMORPHISMS IN NUCLEIC ACID MOLECULES 

TITLE OF INVENTION: ENCODING HUMAN ENZYME PROTEINS, METHODS OF DETECTION AND 
TITLE OF INVENTION: USES THEREOF 
FILE REFERENCE: CL001495 

CURRENT APPLICATION NUMBER: US/10/990 , 328 
CURRENT FILING DATE: 2004-11-17 
NUMEER OF SEQ ID NOS : 558824 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 8794 
LENGTH: 44 9 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-990-328-8794 

Query Match 100.0%; Score 2020; DB 35; Length 449; 

Best Local Similarity 100.0%; Pred. No. 8.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLS PLS F I KVSHVRMQGI LLLVFAKYQHLP YIQI LSTKSTPTGLFGYWGNKGGVNI CL 60 

i 1 1 1 II 1 1 i I II 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i ! 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 78 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 137 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

1 1 1 1 1 1 1 1 1 M II II II M II II I III I II 1 1 II II M II II II I II I M M II II 1 1 

Db 13 8 KLYGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLI IWFGDMNFR 197 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

I M 1 1 1 M 1 1 1 III II I II II I III III M 1 1 III Ml I II I M MM II 1 1 1 II II 

Db 198 I EDFGLHFVRES I KNRC YGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSND Y 2 57 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTP I PPASHFSLSLRGYSSHMTYGI SDHKPVSG 240 

I M II II M 1 1 1 1 1 III II II 1 1 II I II I M 1 1 II 1 1 II I II I M I M 1 1 1 1 II 1 1 M 

Db 258 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPI PPASHFSLSLRGYSSHMTYGI SDHKPVSG 317 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

MM II MM II III M II II MM II II II I II III 1 1 III I II I III I II II I III II 

Db 318 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 377 



Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

II MIMIIIMI 1 1 1 II! IMIM III III IIIIIIMI I IMIMIIIMMIIIIII 

Db 37 8 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 4 37 

Qy 3 61 REDPLGEAQPQI 3 72 

IMIIIIIIIII 
Db 438 REDPLGEAQPQI 44 9 



RESULT 9 

US-09-724-676-79765 

Sequence 79765, Application US/09724676 
GENERAL INFORMATION: 
APPLICANT: Compugen LTD 

TITLE OF INVENTION: Variants of alternative splicing 
FILE REFERENCE: 129181.4 Compugen 
CURRENT APPLICATION NUMBER: US/ 09/724 , 676 
CURRENT FILING DATE: 2000-11-28 
NUMBER OF SEQ ID NOS : 97222 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 79765 
LENGTH: 4 80 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-724-67 6-79765 

Query Match 100.0%; Score 2020; DB 21; Length 480; 

Best Local Similarity 100.0%; Pred. No. 9.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I I 1 1! 1 1 1 M 1 1 1 II I 1 1 1 1 1 1 1 1 1 ; 1 1 II 1 1 1 1 1 M III I II 1 1 1 1 II II 1 1 1 I 1 1 

MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 168 
KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 12 0 

III MM MMMMMIIMM MMMMIMIMMMMMMIMMIMM 

KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 22 8 
IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II I II II I M I M M 1 1 1 1 II II 1 1 M Ml MM MM I II M II I II II II M I II 1 1 1 

IEDFGLHFVRESIK1JRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 2 88 
DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

II Mill II III I lllllllll Mill Mill Mill Mill MM I Mill M 

DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 34 8 
TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

II M MM I Ml I M I II II III 1 1 1 II III I II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 

TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 4 08 
VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

II M Mill 1 1 II M I II II II Ml II II Ml II M I Ml I II Ml Mill I II 1 1 1 1 1 

VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 468 



Qy 


i 


Db 


109 


Qy 


61 


Db 


169 


Qy 


121 


Db 


229 


Qy 


181 


Db 


289 


Qy 


241 


Db 


349 


Qy 


301 


Db 


409 



361 



REDPLGEAQPQI 372 
III Illlllll 



Db 4 69 REDPLGEAQPQI 4 80 



RESULT 10 

US -09 -724- 676 -7 9776 

Sequence 79776, Application US/09724676 
GENERAL INFORMATION: 
APPLICANT: Compugen LTD 

TITLE OF INVENTION: Variants of alternative splicing 
FILE REFERENCE: 129181.4 Compugen 
CURRENT APPLICATION NUMBER: US/ 0 9/724 , 676 
CURRENT FILING DATE: 2000-11-28 
NUMBER OF SEQ ID NOS : 97222 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 79776 
LENGTH: 4 80 
TYPE : PRT 

ORGANISM: Homo sapiens 
US -09 -724 -676-7 9776 

Query Match 100.0%; Score 2020; DB 21; Length 480; 

Best Local Similarity 100.0%; Pred. No. 9.9e-198; 

Matches 3 72; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I 1 1 1 1 M 1 1 1 M II I III Ml II II i M ! II I I II I M I II M I M I M II I Mill 1 

Db 109 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 168 

Qy 61 KL YGYYVS I INCHLPPH I SNNYQRLEHFDR I LEMQNCEGRD I PN I LDHDL I I WFGDMNFR 120 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 : 1 1 1 1 ' I II M II I 

Db 169 KL YGYYVS I INCHLPPH I SNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDL I I WFGDMNFR 22 3 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 

Db 229 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 288 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

I II 1 1 II I I I I I I I II I II I I Ml I II I I I II 1 1 1 I II I I I I I I II I i III I : I I II 

Db 289 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 34 8 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 30 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

Db 34 9 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 40 8 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 36 0 

II Mill 1 1 1 1 1 II II I II II M II M 1 1 1 II III M< M II Ml II 1 1 MM I II II I 

Db 409 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 468 

Qy 361 REDPLGEAQPQI 3 72 

MMIIIMM 
Db 469 REDPLGEAQPQI 4 80 



RESULT 11 

US -09 -724 -676-79782 

; Sequence 79782, Application US/09724676 
; GENERAL INFORMATION: 



APPLICANT: Compugen LTD 

TITLE OF INVENTION: Variants of alternative splicing 
FILE REFERENCE: 12 9181.4 Compugen 
CURRENT APPLICATION NUMBER: US/09/724 , 676 
CURRENT FILING DATE: 2000-11-28 
NUMBER OF SEQ ID NOS : 97222 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 79782 
LENGTH: 4 80 
TYPE: PRT 

ORGANISM: Homo sapiens 
US -09 -724 -676-79782 

Query Match 100.0%; Score 2020; DB 21; Length 480; 

Best Local Similarity 100.0%; Pred. No. 9.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLS PLSF I KVSHVRMQGILLLVFAKYQHLPY IQI LSTKSTPTGLFGYWGNKGGVNI CL 60 

1 11 II I II M II II M I II I MM M MM MM II I M I II II II! II I II II M I II 

Db 109 MDVLS PLSF I KVSHVRMQGILLLVFAKYQHLPY IQI LSTKSTPTGLFGYWGNKGGVNI CL 168 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

I MM I II II M I 1 1 1 1 1 1 1 1 III I M 1 1 1 M II 1 1 II II I 1 1 1 1 1 1 1 M I II 1 1 1 1 1 I 

Db 169 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 228 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180. 

II II Ml II M III 1 1 M M II 1 1 1 1 II I M II II I M 1 1 II I II I II I M II M M M 

Db 22 9 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 288 

Qy 131 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

M II M II 1 1 I M I M I II II 1 1 1 1 Ml! I Ml II II 1 1 1 1 II 1 1 II Ml II I II Ml 

Db 289 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 348 

Qv 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! M 

Db 34 9 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 408 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

IMMIIMI MMMMMMIMMIMIMI MMIMMMMMMMIMM 

Db 4 09 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 468 

Qy 361 REDPLGEAQPQI 372 

llllllllllll 
Db 469 REDPLGEAQPQI 4 80 



RESULT 12 

US-09-724-676A-79765 

Sequence 79765, Application US/09724676A 
GENERAL INFORMATION: 
APPLICANT: Compugen LTD 

TITLE OF INVENTION: Variants of alternative splicing 
FILE REFERENCE: 12 9181.4 Compugen 
CURRENT APPLICATION NUMBER: US/ 0 9/724 , 676A 
CURRENT FILING DATE: 2000-11-28 
NUMBER OF SEQ ID NOS: 97222 
SOFTWARE: Patentln version 3.2 



SEQ ID NO 79765 
LENGTH: 4 80 
TYPE : PRT 

ORGANISM: Homo sapiens 
US -09-724 -676A-79765 

Query Match 100.0%; Score 2020; DB 21; Length 480; 

Best Local Similarity 100.0%; Pred. No. 9.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I I II I III ; I I I I I I I! I II II II III II II I I I I I I I I I I I I III I II I I I I II I I 

Db 109 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 168 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 12 0 

M I II M I II M 1 1 1 II Ml 1 1 II II III MM 1 1 1 1 1 1 1 M I I MM I III M II IM 

Db 169 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 22 8 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

M I II M MM II 1 1 Mill 1 1 1 II II Ml II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II II M III 

Db 229 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 288 

Qy 181 DTSEKKRKPAWTDRI LWRLKRQPCAGPDTP I PPASHFSLSLRGYS SHMTYG I SDHKPVSG 240 

I II 1 1 II 1 1 1 II I 1 1 M I III I II II I II II II 1 1 I M 1 1 I M 1 1 I II Ml I Ml II M 

Db 289 DTSEKKRKPAWTDRI LWRLKRQPCAGPDTP I PPASHFSLSLRGYS SHMTYG I SDHKPVSG 34 8 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFP3SPWDWIGLYKVGLRDVNDY 3 00 

M 1 1 1 M 1 1 1 II I I I II I II 1 1 1 1 II I II 1 1 II 1 1 II II I 1 1 1 II 1 1 1 Ml I Ml 1 1 M 

Db 349 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 408 

QV 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

II 1 1 Ml III II I I I II I II I II I II I II II II 1 1 II M I I II I 1 1 1 1 Ml I Ml II I 

Db 409 VS YAWVGDSKVSCSDNLNQVYIDI SNI PTTEDEFLLC YYSNSLRS WGI SRPFQI PPGSL 468 

Qy 3 61 REDPLGEAQPQI 372 

M I I I! I I II I I 

Db 469 REDPLGEAQPQI 4 80 



RESULT 13 

US-09-724-676A-7 9776 

; Sequence 79776, Application US/09724676A 
; GENERAL INFORMATION: 

APPLICANT: Compugen LTD 
; TITLE OF INVENTION: Variants of alternative splicing 
; FILE REFERENCE: 12 9181.4 Compugen 
; CURRENT APPLICATION NUMBER: US/09/724 , 676A 
; CURRENT FILING DATE: 2000-11-28 
; NUMBER OF SEQ ID NOS : 97222 
; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 79776 

LENGTH: 4 80 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-09-724-676A-79776 

Query Match 100.0%; Score 2020; DB 21; Length 480; 



Best Local Similarity 100.0%; Pred. No. 9.9e-198; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MDVLSPLSFIKVSHWMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

III II 1 1 II I II II 1 1 MM 1 1 II M MM MM II Ml II I M 1 1 1 1 III Ml II 1 1 1 

Db 109 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 168 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

II ! II 1 1 1 II 1 1 1 MM II 1 1 1 III MM II II I MM 1 1 1 1 1 1 II Ml 1 1 1 1 II M I 

Db 169 KL YG YYVS I INCHLPPH I SNNYQRLEHFDRI LEMQNCEGRDI PNI LDHDL I I WFGDMNFR 228 

Qy 121 IEDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II 1 1 1 II I II 1 1 1 1 1 1 III 1 1 MM 1 1 II 1 1 III 1 1 II 1 1 1 I M II I II II I II 1 1 1 1 1 

Db 22 9 IEDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 2 88 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 M M 1 1 1 II 1 1 1 I M I M 1 1 II I M 1 1 1 

Db 289 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 348 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

MIIIIIIIIIMIII MIMIM MMMMMMMMMMMMMMIIMM 

Db 34 9 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGL YKVGLRDVNDY 4 08 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 360 

II M I II I 1 1 1 II I II II II II I M II I M I II II 1 1 M I I I I 1 1 I II Ml II II 1 1 1 

Db 409 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 468 

Qy 3 61 REDPLGEAQPQI 3 72 

I I I II II I II 
Db 4 69 REDPLGEAQPQI 4 80 

RESULT 14 

US-09-724-676A-7 9782 

Sequence 79782, Application US/09724676A 
GENERAL INFORMATION: 
APPLICANT: Compugen LTD 

TITLE OF INVENTION: Variants of alternative splicing 
FILE REFERENCE: 129181.4 Compugen 
CURRENT APPLICATION NUMBER: US/09/724,676A 
CURRENT FILING DATE: 2000-11-28 
NUMBER OF SEQ ID NOS : 97222 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 79782 
LENGTH: 4 80 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-724-676A-7 97 82 

Query Match 100.0%; Score 2020; DB 21; Length 480; 

Best Local Similarity 100.0%; Pred. No. 9.9e-198; 

Matches 3 72; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

MM I II II II M I II III M I II MM I II I II MM III 1 1 1 1 II Ml II I II Ml 1 1 

Db 109 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 168 



Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

M 1 1 1 1 1 1 1 M 1 1 II I II I II I II 1 1 It 1 1 M 1 1 1 1 1 1 1 II I M 1 1 1 1 M 1 1 1 1 1 1 II II 

Db 169 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 228 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 1 III ! I ! 1 1! 1 1 II I! 1 1 1 II! 1 1 II II I : III II I Ml 1 1 1 1 II I! II II 1 1 1 I II 

Db 229 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 288 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

I I I I I T I I I I I M I I I I I I I II I I I I I I I I I I I I I II I I I M I I I I I I i I I I I I I I I I I 
Db 28 9 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 34 8 

Qy 241 TFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWDWIGLY KVGLRDVNDY 3 00 

llllll Mill IIIIIIIIIIIIIIMIIIIIIIIIIIIII IIIIMMIII MM 

Db 34 9 TFDLELKPLVS APL I VLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 4 08 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

1 1 1 II M II I II M 1 1 II I II II 1 1 II I II 1 1 II 1 1 1 1 II 1 1 M I II I M 1 1 1 M M II I 

Db 409 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 468 



Qy 3 61 REDPLGEAQPQI 372 

MIMMIMI 
Db 4 69 REDPLGEAQPQI 4 80 



RESULT 15 
PCT-US02-17526-11 

; Sequence 11, Application PC/TUS0217526 

; GENERAL INFORMATION: 

; APPLICANT: EXELIXIS, INC. 

; TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
; FILE REFERENCE: EX02-074C-PC 

; CURRENT APPLICATION NUMBER: PCT/US02/ 1752 o 

; CURRENT FILING DATE: 2002-06-03 

; PRIOR APPLICATION NUMBER: US 60/2 96,076 

; PRIOR FILING DATE: 2001-06-05 

; PRIOR APPLICATION NUMBER: US 60/328,605 

; PRIOR FILING DATE: 2 001-10-10 

; PRIOR APPLICATION NUMBER: US 60/3 57,253 

; PRIOR FILING DATE: 2002-02-15 

; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 11 

LENGTH: 478 

TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (56) . . (56) 

OTHER INFORMATION: "X" is any amino acid 
PCT-US02-17526-11 

Qiiery Match 99.6%; Score 2012; DB 1; Length 478; 

Best Local Similarity 99.7%; Pred. No. 6.5e-197; 

Matches 371; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 



1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 



Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



107 



61 



167 



121 



301 



40 - 



1 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 

MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 

KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 

I I I I II I I I I I I I I I I II III II III I I I II I I III I I I II I I I II II Ml I II I I I I I I 
KLYGYYVS I INCHLPPH I SNNYQRLEHFDR I LEMQNCEGRD I PNILDHDL 1 1 WFGDMNFR 



VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 

II I M 1 1 1 1 M I II I II III I Mill II 1 1 1 M I M II 1 1 1 1 M 1 1 Ml II II M II 1 1 

VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 



166 



120 



226 



180 



IEDFGLHFWESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II II 1 1 II 1 1 1 1 1 M 

227 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 286 
181 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTP I PPASHFSLSLRG YS SHMTYGI SDHKPVSG 240 

MMMMMIIMI Mill 1 1 M 1 1 1 MM 1 1 M 1 1 M 1 1 MM M 1 1 1 1 1 1 1 M 

2 87 DTSEKKRKPAWTDRILWRLKRQACAGPDTPIPPASHFSLSLRGYSSHMTYGI SDHKPVSG 34 6 
241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

II II II 1 1 1 1 1 1 1 II 1 1 Ml I II III II II 1 1 1 II I II III 1 1 M I II 1 1 1 II I M I M 

347 TFDLELKPLVSAPLIVLMPEDLWTVENDM1WSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 406 



3 61 REDPLGEAQPQI 3 72 

1 1 1 1 1 1 r 1 1 1 1 1 

4 67 REDPLGEAQPQI 478 



360 



466 



Search completed: February 10, 2005, 18:01:44 
Job time : 477 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: February 10, 2005, 17:47:17 ; Search time 23 Seconds 

(without alignments) 
592.366 Million cell updates/sec 

Title : US- 09 -8 92 -2 87-1 

Perfect score: 2020 

Sequence: 1 MDVLSPLSFIKVSHVRMQGT FQIPPGSLREDPLGEAQPQI 372 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 144007 seqs, 36624764 residues 

Total number of hits satisfying chosen parameters: 144007 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 



Listing first 45 summaries 



Database : Pending_Patents_AA_New : * 

1 : /cgn2_6/ptodata/2/paa/PCT_NEW_C0MB.pep: * 

2 : /cgn2_6/ptodata/2/paa/US06_NEW_COMB.pep: * 

3 : /cgn2_6/ptodata/2/paa/US07_NEW_COMB.pep: * 

4 : /cgn2_6/ptodata/2/paa/US08_NEW_COMB.pep: * 

5 : /cgn2_6/ptodata/2/paa/US0 9_NEW_COMB.pep: * 

6 : /cgn2_6/ptodata/2/paa/US10_NEW_COMB.pep: * 

7 : /cgn2_6/ptodata/2/paa/USll_NEW_COMB.pep: * 

8 : /cgn2_6/ptodata/2/paa/US60_NEW_COMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


328 


16 


.2 


908 


1 


PCT-US04 -42 3 60-1594 


Sequence 


1594, Ap 


2 


104 . 5 


5 


.2 


458 


6 


US -10-450-763-57306 


Sequence 


57306, A 


3 


90 . 5 


4 


.5 


497 


6 


US -10-450-763 -33366 


Sequence 


33366, A 


4 


89 . 5 


4 


.4 


873 


6 


US-10-496-011-6 


Sequence 


6, Appli 


c 


89 


4 


.4 


1048 


6 


US -10 -450-763-52226 


Sequence 


52226, A 


6 


88 


4 


.4 


551 


6 


US -10 -450-763-50598 


Sequence 


50598, A 


7 


86 . 5 


4 


.3 


815 


6 


US -10 -496-011-67 


Sequence 


67, Appl 


8 


36 . 5 


4 


.3 


851 


6 


US-10-496-011-3 


Sequence 


3, Appli 


9 


85 


4 


.2 


321 


8 


US -60- 64 3-717-11816 


Sequence 


11816, A 


10 


85 


4 


.2 


512 


6 


US -10-450-763 -34667 


Sequence 


34667, A 


11 


85 


4 


.2 


735 


7 


US-11-021-951-123 


Sequence 


123, App 


12 


85 


4 


.2 


1388 


6 


US -10 -4 50 -763 -51703 


Sequence 


51703, A 


13 


84 .5 


4 


.2 


716 


6 


US -10 -4 50 -763 -4 3 858 


Sequence 


43858, A 


14 


84 .5 


4 


.2 


731 


6 


US -10 -4 50 -763 -4 8706 


Sequence 


48706, A 


15 


84 


4 


.2 


380 


1 


PCT-US04 -17 965-2224 


Sequence 


2224, Ap 


16 


83 


4 


.1 


565 


6 


US-10-450-763-39254 


Sequence 


39254, A 


17 


83 


4 


.1 


738 


6 


US -10 -4 50 -763 -4 8671 


Sequence 


48671, A 


10 


83 


4 


.1 


746 


6 


US -10 -4 50 -763 -41065 


Sequence 


41065, A 


19 


83 


4 


. 1 


746 


6 


US-10-450-763-52958 


Sequence 


52958, A 


20 


83 


4 


. 1 


851 


6 


US -10 -4 50 -763 -562 07 


Sequence 


56207, A 


21 


33 


4 


.1 


1404 


6 


US-10-450-763-34552 


Sequence 


34552, A 


22 


82 .5 


4 


.1 


515 


6 


US-10-450-763-31949 


Sequence 


31949, A 


. 23 


82 .5 


4 


.1 


1114 


1 


PCT-US04-42360-604 


Sequence 


604, App 


24 


82 


4 


.1 


376 


6 


US -10 -450-763 -3 9711 


Sequence 


39711, A 


25 


82 


4 


.1 


940 


6 


US-10-450-763-32557 


Sequence 


32557, A 


26 


81.5 


4 


.0 


243 


7 


US-11-031-175-15153 


Sequence 


15153, A 


27 


81 


4 


.0 


500 


6 


US-10-450-763-58159 


Sequence 


58159, A 


23 


81 


4 


.0 


501 


6 


US-10-450-763-59146 


Sequence 


59146, A 


29 


81 


4 


.0 


2114 


6 


US-10-450-763-39435 


Sequence 


39435, A 


30 


81 


4 


.0 


2356 


6 


US-10-450-763-39431 


Sequence 


39431, A 


31 


8C .5 


4 


. 0 


535 


8 


US-60-643-717-1969 


Sequence 


1969, Ap 


32 


80.5 


4 


. 0 


625 


7 


US -11 -03 1-175 -12 183 


Sequence 


12183, A 


33 


80 .5 


4 


. 0 


631 


6 


US -10 -4 50 -763 -52 92 0 


Sequence 


52920, A 


34 


80 .5 


4 


. 0 


796 


7 


US-11-033-545-386 


Sequence 


386, App 


35 


80 


4 


.0 


633 


6 


US -10 -4 50 -763 -575 83 


Sequence 


57583, A 


36 


80 


4 


.0 


1065 


6 


US -10 -4 50 -763 -56125 


Sequence 


56125, A 



37 


80 


4 


0 


1065 


6 


US 


-10 


-450- 


763 


-58750 


Sequence 


58750, 


A 


38 


79 


. 5 


3 


9 


264 


7 


US 


-11 


-027- 


399 


-3617 


Sequence 


3617, 


Ap 


39 


79 


. 5 


3 


9 


264 


7 


US 


-11 


-027- 


843 


-3617 


Sequence 


3617, 


Ap 


40 


79 


.5 


3 


9 


264 


7 


US 


-11 


-027- 


878 


-3617 


Sequence 


3617, 


Ap 


41 


79 


.5 


3 


9 


264 


7 


US 


-11 


-028- 


169 


-3617 


Sequence 


3617, 


Ap 


42 


79 


.5 


3 


9 


264 


7 


US 


-11 


-028- 


204 


-3617 


Sequence 


3617, 


Ap 


43 


79 


.5 


3 


9 


264 


7 


US 


-11 


-027- 


877 


-3617 


Sequence 


3617, 


Ap 


44 


79 


.5 


3 


9 


264 


7 


us 


-11 


-027- 


879 


-3617 


Sequence 


3617, 


Ap 


45 


79 


.5 


3 


9 


264 


7 


us 


-11 


-028- 


149 


-3617 


Sequence 


3617, 


Ap 



ALIGNMENTS 



RESULT 1 

PCT-US04 -42360-1594 

; Sequence 1594, Application PC/TUS0442360 
; GENERAL INFORMATION: 

APPLICANT: The General Hospital Corporation doing business as Massachusetts 
General 

; APPLICANT: Hospital / Bayer AG 

TITLE OF INVENTION: Nucleotide sequences involved in pain 
; FILE REFERENCE: 17633/2048 

; CURRENT APPLICATION NUMBER: PCT/US04 /423 60 
; CURRENT FILING DATE: 2 004-12-17 

PRIOR APPLICATION NUMBER: 60/531,341 
PRIOR FILING DATE: 19-Dec-2003 
; NUMBER OF SEQ ID NOS : 2587 

SOFTWARE: Perl script 
; SEQ ID NO 1594 
LENGTH: 908 
TYPE: PRT 
; . ORGANISM: Rattus norvegicus 
; PUBLICATION INFORMATION: 

DATABASE ACCESSION NUMBER: Refseq / NP_445928 
DATABASE ENTRY DATE: 2 0 03-10-05 
PCT-US04 -42 3 60-1594 

Query Match 16.2%; Score 328; DB 1; Length 908; 

Best Local Similarity 33.6%; Pred. No. 1.5e-21; 



Matches 84; Conservative 44; Mismatches 106; Indels 16; Gaps 6; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

:: :: :: |: | || : || |:|: :: : ||: | |||| | | : : : 
Db 22 5 YVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLC 2 84 

Qy 69 I INCHLPPHI SNNYQRLEHFDRIL - EMQNCEGRD I PNI LDHDL I I WFGDMNFRIEDFGLH 127 

: I I :| M I =: II = II = I II 1 = 11 I 

Db 285 FVCSHFAAGQSQVKERNEDFVEIARKLSFPMGR MLFSHDYVFWCGDFNYRI -DLPNE 340 

Qy 12 8 FVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKR 187 

Ml—: | ||| | : | | | | : : | | | |j : | | |||||| | 

Db 341 EVKELIRQQNWDSLIAGDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCR 400 

Qy 188 KPAWTDRILWRLKRQPC - - AGPDTPI PPASHFSLSLRGYS SHMTYG ISDHK 236 

lllllhlll := I = I = II I h : : II llh 

Db 401 TPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDE SKI LYTWTPGTLLHYGRAELKTSDHR 460 



Qy 



2 37 PVSGTFDLEL 24 6 



Db 



461 PWALIDIDI 470 



RESULT 2 

US-10-450-763-57306 

; Sequence 57306, Application US/10450763 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
FILE REFERENCE: 790CIP3/US 

CURRENT APPLICATION NUMBER: US/ 10/450 , 763 

CURRENT FILING DATE: 2003-06-11 

PRIOR APPLICATION NUMBER: PCT/US01/ 0863 1 
; PRIOR FILING DATE: 2001-03-30 

PRIOR APPLICATION NUMBER: 09/540,217 

PRIOR FILING DATE: 2000-03-31 
; PRIOR APPLICATION NUMBER: 09/649,167 

PRIOR FILING DATE: 2000-08-23 
; NUMBER OF SEQ ID NOS : 60736 
; SOFTWARE: Custom 
; SEQ ID NO 57306 
LENGTH: 4 58 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: DOMAIN 
LOCATION: (235).. (278) 

OTHER INFORMATION: 1 kw MIND INHIBITOR DIVISION CONTROL domain identified by 
OTHER INFORMATION: eMATRIX, accession number DM01756B, p-value=^4 . 822e-15 , 
raw score of 

OTHER INFORMATION: 22.91 
FEATURE : 

NAME /KEY: DOMAIN 
LOCATION: (200) . . (305) 

OTHER INFORMATION: ParA family ATPase domain identified by PFam, accession 

name 

OTHER INFORMATION: ParA, E-value=4 . 9e-24 , PFam score of 93.3 
US-10-450-763-57306 

Query Match 5.2%; Score 104.5; DB 6; Length 458; 

Best Local Similarity 20.9%; Pred. No. 0.12; 

Matches 71; Conservative 51; Mismatches 133; Indels 85; Gaps 17; 
Qy 46 LFGYWGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNI 105 



Db 



I :|||| 
8 V I GVAAHKGGV 





149 



QY 



106 - - LDHDLI IWFGDMNFRI EDFGLHF VRESI KNRCYGGL - WEKDQLS I AKKHDP 

: I I | = = II I I I HI h II | = : : 

150 ASMYHG WVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCWPGLDIIPSCLALHRIETE 



155 



Db 



206 



Qy 



156 LLREFQEGRLLFPP TYKFDRNSNDYDTSEKKRKP AWTDR I LWRLKRQPCAGPD 



208 



Db 





259 



Db 



Qy 



260 



209 



TPIP-PASHFSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVEN 2 67 

= 1 Ml 1 = 1 = = ||= | = | |= | = 

LI VPTPAELFD YTSALQF FDMLRDLLKNVDLKGFEPDKKWLAGH 3 03 



Qy 



268 



DMMVSYSSTSDFPSSPWDWIGL-YKVGLRDVNDYVSYAWV GDSKVSCSDNLNQVYI 322 



Db 



304 




Qy 



323 



DI SNI PTTEDEFLLCYYSNSLRS WGI SRPFQI PPGSLRE 3 62 



Db 



361 



ILGTVTL DAVGIGLVMPVLPGLLRD 3 8 




RESULT 3 

US-10-450-763-33366 

; Sequence 33366, Application US/10450763 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

FILE REFERENCE: 7 90CIP3/US 

CURRENT APPLICATION NUMBER: US/10/4 50 , 763 

CURRENT FILING DATE: 2003-06-11 

PRIOR APPLICATION NUMBER: PCT/USOl/08631 
; PRIOR FILING DATE: 2001-03-30 

PRIOR APPLICATION NUMBER: 09/540,217 

PRIOR FILING DATE: 2000-03-31 

PRIOR APPLICATION NUMBER: 09/649,167 
; PRIOR FILING DATE: 2000-08-23 
; NUMBER OF SEQ ID NOS : 60736 
; SOFTWARE: Custom 
; SEQ ID NO 33366 
LENGTH: 4 97 
TYPE : PRT 
; ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (235) . . (278) 

OTHER INFORMATION: 1 kw MIND INHIBITOR DIVISION CONTROL domain identified by 
OTHER INFORMATION: e MATRIX, accession number DM0 1756B, p- value=4 . 822e- 15 , 
raw score of 

OTHER INFORMATION: 22.91 
FEATURE : 

NAME /KEY : DOMAIN 
LOCATION: (200) . . (307) 

OTHER INFORMATION: ParA family ATPase domain identified by PFam, accession 

name 

OTHER INFORMATION: ParA, E-value=6 . 5e -23 , PFam score of 89.5 
US-10-450-763-33366 

Query Match 4.5%; Score 90.5; DB 6; Length 4 97; 

Best Local Similarity 20.5%; Pred . No. 2.4; 

Matches 84; Conservative 59; Mismatches 138; Indels 129; Gaps 23; 

Qy 4 6 LFGYWGNKGGVNI CLKL YGYYVS I INCHLPPHI SNNYQRLEHFDR I LEMQNCEGRD I PNI 105 

: I HIM | : :: || :: : | = || | 

Db 108 VI GVAAHKGG V YKTSVSVHLAQDLAL KGLRVLLVEGNDPQGT 149 



Qy 106 - -LDHDLIIWFGDMNFRIEDFGLHF VRESIKNRCYGGL-WEKDQLSIAKKHDP 155 

: I I I - • II II I Hi h II l = = = 

Db 150 ASMYHG WVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCWPGLDIIPSCLALHRIETE 206 

Qy 156 LLREFQEGRLLFPP TYKFDRNSNDYDTSEKKRKP AWTDRILWRLKRQPCAGPD 208 

h :| Ihl I = -III | | :: || 
Db 2 07 LMGKFDEGKLPTDPHLMLRLAIETVAHDYDVIVIDSAPNLGIGTINW CAADV 259 

Qy 209 TPIP-PASHF SLSLRGYS SHMTYG I SDHKPVSGTFDLELKPLV 250 

= | | | | :: |:|: : -hill 

Db 2 60 LIVPTPAELFDYTSALQFFDMLRDLLKNVDLKGFEPDTSVSTVTLPSLTLTLNL-LPPSY 318 

Qy 251 SAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSP 2 83 

I = I lh = : = : =| I I : I I 

Db 319 KDPCDYIGP- -TWTIQDNLPISRSLT- -LTTSPKFFLPSERLSSSWLDVRHIEKYVDQGK 374 

Qy 284 WDW IG LYKVGLRD VND YVS Y AWVGD S KVS C S DNLNQ V - 320 

II II | : = | I = :|| =| I I = = =1 

Db 3 75 SGTRELLWSWAQKNKTIGDLLQVLQEMGHRRAIHLITNYVRHA-VTKSATATATGYQEVL 4 33 

Qy 321 YIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSLRED-PL 365 

= |:= I = I : Mill: I = hh II 

Db 434 NSEQILDLTKAP EVFAIPIQGNRLRGVVLVVGRFGV- -SQLKEEHPL 478 



RESULT 4 
US-10-496-011-6 

Sequence 6, Application US/10496011 
GENERAL INFORMATION: 
APPLICANT : BOURGERON, THOMAS 
APPLICANT: JAMAIN, STEPHANE 
APPLICANT: QUACH, HELENE 
APPLICANT : BETANCUR, CATALINA 
APPLICANT: LEBOYER, MARION 
APPLICANT : GILLBERG , CHRISTOPHER 

TITLE OF INVENTION: POLYNUCLEOTIDE AND PROTEIN INVOLVED IN SYNAPTOGENESIS , 
VARIANTS 

TITLE OF INVENTION: THEREOF, AND THEIR THERAPEUTIC AND DIAGNOSTIC USES 
FILE REFERENCE: 253820US0XPCT 
CURRENT APPLICATION NUMBER: US/ 10/496 , 011 
CURRENT FILING DATE: 2004-05-28 
PRIOR APPLICATION NUMBER: PCT/FR02 / 04 134 
PRIOR FILING DATE: 2002-11-28 
PRIOR APPLICATION NUMBER: CA2364106 
PRIOR FILING DATE: 2001-11-30 
NUMEEP. OF 3EQ ID NOS : 98 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 6 
LENGTH: 873 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-^96-011-6 



Query Match 4.4%; Score 8 9.5; DB 6; Length 873; 

Best Local Similarity 18.8%; Pred. No. 6.4; 

Matches 85; Conservative 59; Mismatches 134; Indels 173; Gaps 22; 



Qy 5 SPLSFIKVSH VRMQGILLLVFA-KYQHLPYIQILSTKSTPTGLFGYWGNKG 54 

I =1 = =11 : I M II I :|h =1 
Db 315 SCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILA DKV 361 

Qy 55 GVNI CLKLYGYYVS I INCHLP PHI SNNYQRLEH 87 

I h III I I I II 

Db 362 GCNMLDTTDMVECLKNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLN 421 

Qy 88 FDR I LEMQKTCEG - RD I PNI LDHDL 1 1 WFGDMNFR I EDF GLHFVRESIKNRC 137 

= | =| : || : : |:|:: : | =| = :| | :||:|| 

Db 422 YDIMLGWQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIK-FM 4 80 

Qy 138 YGGLWEKDQLS I AKKHDPLLREFQEGRLLFP PTYKFDRNSNDYDTSE 184 

| :|: :| |: | : : : | ||| : | : 

Db 481 YTDWADKENPETRRK- - TLVALFTDHQWVAPAVATADLHAQYGSPTYFY AFYHHCQ 534 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

: Ihl I II - I I 
Db 535 SEMKPSWAD SAHGDEVPYVFGI 556 

Qy 245 ELKPLVSAPL I VLMPEDLWTV ENDMMVS YSSTSDFPSSPWDWIGLYKV 292 

|:: | =|:: :||:|:| :: | | |: | 
Db 557 PMIG PTELFSCNFSKNDVMLSAWMTYWTNFAKTGD -PNQPVP 598 

Qy 293 GLRDV ND YVS Y AWVGD S KVS C S DNLNQ V Y I D I S N I P TTED E F L 335 

:| I = II Jl : I 1 = 1 = I I I = I 
Db 599 - - QDTKFIHTKPNRFEEVAW SKYNPKD QLYLHIGLKPRVRDHYRATKVAFWLEL 650 

Qy 336 LCYYSNSLRSWGISRPFQIPPGSLREDPLG 3 66 

: : | =| ::|| : | | 

Db 651 VPHLHNLNEIFQYVSTTTKVPPPDMTSFPYG 681 



RESULT 5 

US-10-450-763-52226 

; Sequence 52226, Application US/10450763 
; GENERAL INFORMATION: 
/ APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

FILE REFERENCE: 790CIP3/US 
; CURRENT APPLICATION NUMBER: US/10/450,763 
; CURRENT FILING DATE: 2003-06-11 
; PRIOR APPLICATION NUMBER: PCT/US01/08631 
; PRIOR FILING DATE: 2001-03-30 
; PRIOR APPLICATION NUMBER: 09/540,217 
; PRIOR FILING DATE: 2000-03-31 
; PRIOR APPLICATION NUMBER: 09/649,167 
; PRIOR FILING DATE: 2000-08-23 
; NUMBER OF SEQ ID NOS : 60736 

SOFTWARE: Custom 
; SEQ ID NO 52226 
LENGTH: 104 8 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: DOMAIN 



LOCATION: (410) . . (439) 

OTHER INFORMATION: kw REPLICATION REPB RE PA INITIATION domain identified by 
OTHER INFORMATION: eMATRIX, accession number DM00653A, p-value=7 . 207e-27 , 
raw score of 

OTHER INFORMATION: 2 0.40 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (651) . . (765) 

OTHER INFORMATION: ParA family ATPase domain identified by PFam, accession 
name 

OTHER INFORMATION: ParA, E-value=2 . 7e-2 8 , PFam score of 107.5 
US-10-450-763-52226 

Query Match 4.4%; Score 89; DB 6; Length 104 8; 

Best Local Similarity 21.4%; Pred. No. 9; 

Matches 75; Conservative 47; Mismatches 112; Indels 116; Gaps 19; 

Qy 46 LFG YWGNKGGVN I CLKL YG Y YVS I INCHLP PH I SNNYQRLEHFDR I LEMQNCEGRD I PN I 105 

: ! :|||| | : :: || :: = | = || | 

Db 553 V I G VAAHKGG V YKTSVSVHLAQDLAL KGLRVLLVEGNDPQGT 599 • 

Qy 106 - -LDHDLIIWFGDMNFRIEDFGLHF VRES I KNRCYGGL - WEKDQLS I AKKHDP 155 

: I I I- II I I I : I I 1= II I- = 

Db 600 ASMYHG WVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCWPGLDI IPSCLALHRIETE 656 

Qy 156 LLREFQEGRLLFPP - TYKFDRNSNDYDTSEKKRKP AWTDRILWRLKRQPCAGPD 208 

h :| Ihl I : = = 111 | | :: || 
Db 657 LMGKFDEGKLPTDPHLMLRLAIETVAHDYDVIVIDSAPNLGIGTINW CAADV 709 

Qy 209 TPIP-PASHFSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVEN 267 

= 1 II I hi = = 1 1 : I : I I II I 

Db 710 LIVPTPAELFD YTSALQF FDM-LRDL LKNVDLKGFEP 745 

Qy 268 D- -MMVSYSSTSDFPSSPW DWIGLYKVG 293 

I = = = = I 1= III I = 1 II 

Db 746 DVRILLTKYSNSNGSQSPWMEEQIRDAWGSMVLKNWRETDEVG- -KVAEYNGYHFVFAL 803 

Qy 2 94 LRD VND YVS YAWVGD S KVS C S DNLNQ VY I D ISNIPTTEDE 3 33 

1 = 1=1 llh = I =1= I = = l I =1 = 

Db 804 AGSPKDAVDTSIYMFYQKVGDNSIDSWKNAGRVFKDSDKFVANDPILKDQ 853 



RESULT 6 

US- 10-450 -763 -505 98 

; Sequence 50598, Application US/10450763 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
; FILE REFERENCE: 790CIP3/US 

; CURRENT APPLICATION NUMBER: US/ 10/450 , 763 

CURRENT FILING DATE: 2003-06-11 
; PRIOR APPLICATION NUMBER: PCT/USOl/08631 
; PRIOR FILING DATE: 2001-03-30 

PRIOR APPLICATION NUMBER: 09/540,217 
; PRIOR FILING DATE: 2000-03-31 
; PRIOR APPLICATION NUMBER: 09/649,167 
; PRIOR FILING DATE: 2000-08-23 



NUMBER OF SEQ ID NOS : 60736 
SOFTWARE : Custom 
SEQ ID NO 50598 
LENGTH: 551 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (1) . . . (551) 

OTHER INFORMATION: Xaa = X or * as defined in Table 2 
US- 10 -450 -763 -50598 

Query Match 4.4%; Score 88; DB 6; Length 551; 

Best Local Similarity 22.8%; Pred. No. 4.7; 

Matches 64; Conservative 39; Mismatches 84; Indels 94; Gaps 19; 

Qy 4 LS PLS F I KVSHVRMQGI L LLVFAKY QHL PYIQILSTKSTPTGLFG 48 

Ml lllh I 11 = 1 = 11 = 11 = I 
Db 188 LSP-TYPAAHHFR-SGIIGSGLCVFSKHPIQELTQHIYTLNGYPYMI HHG 235 

Qy 4 9 YWGNKGGVN-ICLKLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILD 107 

1=1=111== 11111= II : - : 

Db 23 6 DWFSGKAVGLLVLHLNGMELNAYVTHL - -HAEYNRQK DIYLTHLVGQAWELAQFIH 289 

Qy 108 H DLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLL- -REF 160 

I l = = = 11 = 1 II I = = | =11 III hi 

Db 2 90 HTS KKAD WLLCGDLNMHPEDLGCCLLKE WTGL HDAYLETRDF 332 

Qy 161 QEGRLLFPPTYKFDRNSNDYDTSEKKRKP AWTDRILWRLKRQPCAGPDTPIPP 213 

= 11=1 II l = = = II I =l = = 
Db 333 KGSEEGNTMVP KNCY-VSQQELKPFPFGVRIDYVLYK A 369 

Qy 214 ASHFSLSLR GYS SHMT YGI SDHKPVSGTFDLELKP 24 8 

11=1= 1=1 =111= =1=1 

Db 370 VSGFYISCKSFETTTGFDPHSGTPLSDHEALMATLFVRHSP 410 



RESULT 7 

US-10-496-011-67 

Sequence 67, Application US/10496011 
GENERAL INFORMATION: 
APPLICANT : BOURGERON, THOMAS 
APPLICANT: JAMAIN, 3TEPHANE 
APPLICANT: QUACH, HELENE 
APPLICANT : BETANCUR , CATALINA 
APPLICANT: LEBOYER, MARION 
APPLICANT: GILLBERG, CHRISTOPHER 

TITLE OF INVENTION: POLYNUCLEOTIDE AND PROTEIN INVOLVED IN SYNAPTOGENESIS , 
VARIANTS 

TITLE OF INVENTION: THEREOF, AND THEIR THERAPEUTIC AND DIAGNOSTIC USES 
FILE REFERENCE: 253820US0XPCT 
CURRENT APPLICATION NUMBER: US/ 10/4 96 , 011 
CURRENT FILING DATE: 2004-05-28 
PRIOR APPLICATION NUMBER: PCT/FR02/ 04134 
PRIOR FILING DATE: 2002-11-28 
PRIOR APPLICATION NUMBER: CA2 364106 
PRIOR FILING DATE: 2001-11-30 



; NUMBER OF SEQ ID NOS : 98 

; SOFTWARE: Patentln version 3.3 

; SEQ ID NO 67 

LENGTH: 815 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-496-011-67 

Query Match 4.3%; Score 86.5; DB 6; Length 815; 

Best Local Similarity 18.6%; Pred. No. 11; 

Matches 84; Conservative 60; Mismatches 134; Indels 173; Gaps 22; 

Qy 5 SPLSFIKVSH VRMQG I LLL VFA - KYQHLP Y I Q I LSTKS TPTGLFG YWGNKG 54 

I =| : :|| = I I =1 II I =11= : I 

Db 257 SCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILA DKV 303 

Qy 55 GVNI CLKLYGYYVSI INCHLP PHISNNYQRLEH 87 

11= 11= I I I II 

Db 3 04 GCNMLDTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLN 3 63 

Qy 88 FDRILEMQNCEG-RDIPNILDHDLIIWFGDMNFRIEDF GLHFVRES I KNRC 137 

= 1 =1 = II = = | = | = = = | :| = =| | =|| = ll 

Db 3 64- YDIMLGVNQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIK-FM 422 

Qy 13 8 YGGLWEKDQLS IAKKHDPLLREFQEGRLLFP PTYKFDRNSNDYDTSE 184 

1=1= = I I = I = = = I III = I = 

Db 423 YTDWADKENPETRRK- - TLVALFTDHQWVAPAVATADLHAQYGSPTYFY AFYHHCQ 476 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

= 11 = 1 I I I =11 
Db 4 77 SEMKPSWAD S AHGDEVP YVFG I 498 

Qy 24 5 ELKPLVS APL I VLMPEDLWTV ENDMMVS YSSTSDFPSSPWDWIGLYKV 2 92 

l = = I =l = = =ll = hl == I I h I 

Db 499 PMIG PTELFSCNFSKNDVMLSAWMTYWTNFAKTGD-PNQPVP 540 

Qy 293 GLRDV NDYVS YAWVGDSKVSCSDNLNQVYIDI SNI PTTEDEF L 335 

= 1 1=11 II- = I 1 = 1= 111= I 
Db 541 - -QDTKFIHTKPNRFEEVAW SKYNPKD QLYLHIGLKPRVRDHYRATKVAFWLEL 592 

Qy 3 36 LCYYSNSLRSWGISRPFQIPPGSLREDPLG 366 

= = I =1 = = || : I I 

Db 5 93 VPHLHNLNE I FQYVSTTTKVPPPDMTS FP YG 62 3 



RESULT 8 
US-10-496-011-3 

Sequence 3, Application US/10496011 
GENERAL INFORMATION: 
APPLICANT: 30URGER0N , THOMAS 
APPLICANT: JAMAIN, STEPHANE 
APPLICANT: QUACH, HELENE 
APPLICANT: BETANCUR, CATALINA 
APPLICANT: LEBOYER, MARION 
APPLICANT: GILLBERG, CHRISTOPHER 

TITLE OF INVENTION: POLYNUCLEOTIDE AND PROTEIN INVOLVED IN SYNAPTOGENESIS , 
VARIANTS 



TITLE OF INVENTION: THEREOF, AND THEIR THERAPEUTIC AND DIAGNOSTIC USES 
FILE REFERENCE: 25382 OUSOXPCT 
CURRENT APPLICATION NUMBER: US/10/4 96 , Oil 
CURRENT FILING DATE: 2004-05-28 
PRIOR APPLICATION NUMBER: PCT/FR02/ 04 134 
PRIOR FILING DATE: 2002-11-28 
PRIOR APPLICATION NUMBER: CA2364106 
PRIOR FILING DATE: 2001-11-30 
NUMBER OF SEQ ID NOS : 98 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 3 
LENGTH: 851 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-496-011-3 

Query Match 4.3%; Score 86.5; DB 6; Length 851; 

Best Local Similarity 18.6%; Pred. No. 11; 

Matches 84; Conservative 60; Mismatches 134; Indels 173; Gaps 22; 

Qy 5 SPLSFIKVSH VRMQGILLLVFA-KYQHLPYIQILSTKSTPTGLFGYWGNKG 54 

I :| : :|| I I : l II I =lh =1 
Db 2 93 SCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILA DKV 3 39 

Qy 55 GVNI CLKLYGYYVS I INCHLP PHISNNYQRLEH 87 

I 1= 11= I I I II 

Db 34 0 GCNMLDTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLN 3 99 

Qy. 88 FDR I LEMQNCEG - RD I PN I LDHDL 1 1 WFGDMNFRI ED F GLHFVRES I KNRC 137 

= | =| : || : : |:|:: : | =| : =| | :||:|| 

Db 400 YDIMLGVNQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIK-FM 4 58 

Qy 13 8 YGGLWEKDQLS I AKKHDPLLREFQEGRLLFP PTYKFDRNSNDYDTSE 184 

| :|: : | | : | : : : | ||| : | = 

Db 459 YTDWADKENPETRRK- -TLVALFTDHQWVAPAVATADLHAQYGSPTYFY AFYHHCQ 512 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

: IM I I I HI 
Db 513 SEMKPSWAD SAHGDEVPYVFGI 534 

Qy 245 ELKPLVSAPLIVLMPEDLWTV ENDMMVS YSSTSDFPSSPWDWIGLYKV 292 

l = = I = l = = =11=1 = 1 -111=1 
Db 535 PMIG PTELFSCNFSKNDVMLSAWMTYWTNFAKTGD-PNQPVP 576 

Qy 293 GLRDV NDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEF L 335 

= 1 1=1111=1 hh 111= I 
Db 577 - -QDTKFIHTKPNRFEEVAW SKYNPKD QL YLH I GLKPRVRDH YRATKVAFWLEL 628 

Qy 33 6 LCYYSNSLRSWGISRPFQIPPGSLREDPLG 3 66 

: : | :| ::|| : | | 

Db 62 9 VPHLHNLNE I FQYVSTTTKVPPPDMTSFPYG 659 



RESULT 9 

US -60 -643 -7 17 -11816 

; Sequence 11816, Application US/60643717 
; GENERAL INFORMATION: 



APPLICANT: Abad, Mark S. 

TITLE OF INVENTION: Genes and Uses for Plant Improvement 
FILE REFERENCE: 38-21 (53629) A 
CURRENT APPLICATION NUMBER : US/60/64 3 , 717 
CURRENT FILING DATE: 2005-01-12 
NUMBER OF SEQ ID NOS : 19247 
SEQ ID NO 11816 
LENGTH: 321 
TYPE : PRT 

ORGANISM: Candidatus Blochmannia floridanus 
US -60 -64 3 -717-11816 

Query Match 4.2%; Score 85; DB 8; Length 321; 

Best Local Similarity 23.8%; Pred. No. 4.3; 

Matches 35; Conservative 23; Mismatches 67; Indels 22; Gaps 5; 

Qy 51 GNKGGVN ICLKLYGYYVS I INCHLPPHI SNNYQRLEHFDRILEMQNCEGRD I PNI L 106 

hi I I :| | 11= = =1 I I = = I :| = l M I 

Db 177 GHKAAVNHVNLLCKKAGAKYVTTLP I SVPSHCS I MKKMVSRFQKI I ENTVISSPKIPVIN 236 

Qy 107 DHDLI I WFGDMNFRIEDFGLHFVRES IKNRCYGGL - WEKDQLS IAKKHDPLLREFQEG - - 163 

: |: | : |:|:|: = | = | = | =: | | 

Db 23 7 NVDVSI EQEPQFIRDSLIRQLYNPVRWNEIMQEFINKDIKIVLEMGPGKV 2 86 

Qy 164 - -RLLFPPTYK FDRNSNDYDTSEK 185 

11= I I = II h I 

Db 2 87 LTRLIQRSVYSDSLFSLS INDVDSLSK 313 



RESULT 10 

US-10-450-763-34667 

; Sequence 34667, Application US/10450763 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
; FILE REFERENCE: 790CIP3/US 

; CURRENT APPLICATION NUMBER: US/ 10/450 , 763 

; CURRENT FILING DATE: 2003-06-11 

; PRIOR APPLICATION NUMBER: PCT/US01/08631 

; PRIOR FILING DATE: 2001-03-30 

; PRIOR APPLICATION NUMBER: 09/540,217 

; PRIOR FILING DATE: 2000-03-31 

PRIOR APPLICATION NUMBER: 09/649,167 

PRIOR FILING DATE: 2000-08-23 
; NUMBER OF SEQ ID NOS: 60736 

SOFTWARE: Custom 
; SEQ ID NO 34667 
LENGTH: 512 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME / KEY : DOMAIN 
LOCATION: (116) . . (14 3) 

OTHER INFORMATION: Mrp family proteins domain identified by eMATRIX, 
accession 

OTHER INFORMATION: number BL01215A, p-value=4 . 964e - 11 , raw score of 9.75 
FEATURE : 



NAME/ KEY : DOMAIN 
LOCATION: (211) . . (302) 

OTHER INFORMATION: ParA family ATPase domain identified by PFam, accession 
name 

OTHER INFORMATION: ParA, E-value=0 . Oil , PFam score of -1.9 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (1) . . . (512) 

OTHER INFORMATION: Xaa = X or * as defined in Table 2 
US-10-450-763-34667 

Query Match 4.2%; Score 85; DB 6; Length 512; 

Best Local Similarity 21.6%; Pred. No. 8; 

Matches 68; Conservative 43; Mismatches 124; Indels 80; Gaps 16; 

Qy 46 LFGYVJGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNI 105 

: I HIM | : :: || :: : | : || | 

Db 119 VIGVAAHKGGV YKTSVSVHLAQDLAL KGLRVLLVEGNDPQGT 160 

Qy 106 - - LDHDLI IWFGDMNFRIEDFGLHF VRES IKNRCYGGL - WEKDQLS I AKKHDP 155 

: I I I ' • II II I 'II h II | = : : 

Db 161 ASMYHG WVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCWPGLDIIPSCLALHRIETE 217 

Qy 156 LLREFQEGRLLFPP TYKFDRNSNDYDTSEKKRKP AWTDRILWRLKRQPCAGPD 208 

h :| Ihl I : -IN I I : I II 
Db 218 LMGKFDEGKLPTDPHLMLRLGIETVAHDYDVLVIDSPPNLGIGTINVRWAADVQP 2 72 

Qy 209 TP I PPASHFS LS LRG YS SHMTYGI SDHKP VSGTFDLE - LKPLVSAPL I VLMPEDLWTVEN 267 

| : : | : | | : | : | | : : | | : : | 

Db 273 LXIPKQQQNEKYQVPQ FDQSTIKNIESAKGLDVW- -DSWPLQN 313 

Qy 268 -DMMVS YS STSDFPS S PWDW IGL YKVGLRDVND WS YAWVGDS KVSCSDNLNQVT I 322 

Ihl III I : h I Ml: = I =h 

Db 314 ADGT VAE YNG YH WF AL AG S P KD ADDTS I YMFYQKVGDNS IDSWKNAGRVFK 365 

Qy 323 DI SNIPTTEDE 333 

I =1 I =h 

Db 3 66 DSDKFDANDPILKDQ 3 80 



RESULT 11 
US-11-021-951-123 

Sequence 123, Application US/11021951 
GENERAL INFORMATION: 
APPLICANT: HAUPTS , Ulrich 
APPLICANT: KOLTERMANN , Andre 
APPLICANT : SCHEIDIG, Andreas 
APPLICANT: VOTSMEIER, Christian 
APPLICANT: Kettling, Ulrich 
APPLICANT: COCO, Wayne Michael 

TITLE OF INVENTION: New Biological Entities And The Pharmaceutical 
TITLE OF INVENTION: And Diagnostic Use Thereof 
FILE REFERENCE: 04156. 0002U5 
CURRENT APPLICATION NUMBER: US/ 11/02 1 , 951 
CURRENT FILING DATE: 2004-12-22 
PRIOR APPLICATION NUMBER: 10/872,198 
PRIOR FILING DATE: 2004-06-18 



PRIOR APPLICATION NUMBER : 60/543,518 
PRIOR FILING DATE: 2004-02-11 
PRIOR APPLICATION NUMBER: 60/524,960 
PRIOR FILING DATE: 2003-11-25 
PRIOR APPLICATION NUMBER: EP 04003058 
PRIOR FILING DATE: 2004-02-11 
PRIOR APPLICATION NUMBER: EP 03025871 
PRIOR FILING DATE: 2003-11-11 
PRIOR APPLICATION NUMBER: EP 03025851 
PRIOR FILING DATE: 2003-11-10 
PRIOR APPLICATION NUMBER: EP 03013819 
PRIOR FILING DATE: 2003-06-18 
NUMBER OF SEQ ID NOS : 191 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 12 3 
LENGTH: 73 5 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-11-021-951-123 

Query Match 4.2%; Score 85; DB 7 ; Length 735; 

Best Local Similarity 18.9%; Pred . No. 13; 

Matches 68; Conservative 53; Mismatches 13 6; Indels 102; Gaps 14; 

Qy 62 LYGYYV5IINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDH-DLIIWFGDMNFR 12 0 

I III I =1 I =::= I * * I =11= = 111= = 

Db 19 LLGYYFSDLNFQAPMWTSS TTGDLSIPSSELENIPSENQYFQSAIWSGFIKVK 72 

Qy 121 IEDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSND - 17 9 

| : | | : :| || | | = = = || |= = = I = 

Db 73 KSD- -EYTFATSADN- -HVTMWVDDQEVINKASNSNKIRLEKGR-LYQIKIQYQRENPTE 127 

0y 18 C YDT S E KKR KP AWTDR I L W RLKRQPCAGPDTP IPPAS 215 

= h h= =1 = III III I II 

Db 120 KGLDFKLYVJTDSQNKKEVISSDNLQLPELKQKSSNSRKKRSTSAGPTVPDRDNDGIPD-- 185 

Qy 216 HFSLSLRGY SSHMTYGI SDHKPVSGTF 242 

II = II =11= II = 1 = 1 

Db 186 - -SLEVEGYTVDVKNKRTFLSPWISNIHEKKGLTKYKSSPEKWSTASDPYSDFEKVTGRI 243 

Qy 24 3 DLELKPLVSAPLIVLMP EDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRD 2 96 

1=1 11= I h= =1= = == I I 
Db 244 DKNVS PEARHPLVAAYP I VHVDMENI I LS KNEDQSTQNTDSQ TRT 288 

Qy 297 VNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

= = I = hi = == = II = =111 I I I 

Db 289 ISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAG FSNSNSSTVAIDHSLSL 340 



RESULT 12 

US -10 -450 -7 63 -517 03 

Sequence 51703, Application US/10450763 
GENERAL INFORMATION: 
APPLICANT: Hyseq, Inc 

TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
FILE REFERENCE: 790CIP3/US 

CURRENT APPLICATION NUMBER: US/ 10/450 , 763 



CURRENT FILING DATE: 2003-06-11 
PRIOR APPLICATION NUMBER: PCT/US01/ 0863 1 
PRIOR FILING DATE: 2001-03-30 
PRIOR APPLICATION NUMBER: 09/540,217 
PRIOR FILING DATE: 2000-03-31 
PRIOR APPLICATION NUMBER: 0 9/64 9,167 
PRIOR FILING DATE: 2000-08-23 
NUMBER OF SEQ ID NOS : 60736 
SOFTWARE : Custom 
SEQ ID NO 51703 
LENGTH: 13 88 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: DOMAIN 
LOCATION: ( 12 93 ) . . ( 13 18 ) 

OTHER INFORMATION: Ubiquitin carboxyl -terminal hydrolases family 2 proteins 
OTHER INFORMATION: domain identified by eMATRIX, accession number BL00972D, 
p-value- 

OTHER INFORMATION: 7.750e-19, raw score of 22.55 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (1290) . . (1350) 

OTHER INFORMATION: Ubiquitin carboxyl- terminal hydrolase family domain 
OTHER INFORMATION: identified by PFam, accession name UCH-2, E-value=l . le- 
26, PFam 

OTHER INFORMATION: score of 102.0 
US -10 -4 50 -763 -5 17 03 

Query Match 4.2%; Score 85/ DB 6; Length 1388; 

Best Local Similarity 20.5%; Pred. No. 30; 

Matches 83; Conservative 59; Mismatches 160; Indels 102; Gaps 22; 

Qy 32 PYIQILSTKSTPTGLFG- -YWGN- -KGGVNICLKLY-GYYVSIINCHLPPHISNNYQRLE 86 

lh = : = I I I = =1 = 1= I I = I III : 

Db 665 PYVELKDSDGRPDWEVAAEAWDNHLRRNRS 1 1 VDLFHGQLRSQVKCKTCGHI SVRFDPFN 724 

Qy 37 HFDRILEMQNCEGRDIPNI-LDHDLIIWFG-DMNFRIEDFGLHFVRESIKNRCYGGLWEK 144 

I I : :| | || : :| :| : || :: ::: | || 

Db 725 FLSLPLPMDSYMDLEITVIKLDGTTPVRYGLRLNMDEKYTGL KKQLRDLC- -GL-NS 778 

Qy 14 5 DQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPC 2 04 

:h :|: II =::| II 1= II = I 
Db 779 EQILLAEVHDSNIKDFSSS PST NGMFTLTTNGDL PKP I F I PN 820 

Qy 2 05 AGPDTPIPPASHFSLSLRGYSSHMT YG I SDHKPVSGTFDLELKPLVS APL I 255 

| = | :| = = = : II | |: |: : | | | = | : 

Db 821 GMPNTWPCGTEKNFTNGMVNGHMPSLPDSPFTGYIIAVHRKMMRTELYFLSPQENRPSL 880 

Qy 2 56 VLMPEDLWTVENDMMVS YSSTSDFPSSPW DWIG 2 88 

II ::| :: | : | |: 

Db 881 FGMP L I VPCTVHTQKKDLYDAVW IQVS WLARPLPPQEAS IHAQDRDNCMGY 931 

Qy 2 89 L YKVGLRDV - ND YVS YAWVGD SKVSCSDN LNQVYIDI SNI PTTEDEFLLCY 33 8 

I II I hi II |: | :: : || : || | | 

Db 932 QYPFTLRWQKDGISCAWCPQYRFCRGCKIDCGEDRAFIGNAYIAVDWHPTA LHLRY 988 



Qy 339 YSNSLRSV VGISRPFQIPPGSL REDPLGEAQ 369 

:= I I I II I hi h 111 = = 

Db 989 QTSQERWDKHESVEQSRRAQAEPINLDSCLRAFTSEEELGESE 1032 



RESULT 13 

US- 10 -45 0 -763 -43 858 

Sequence 43858, Application US/10450763 
GENERAL INFORMATION: 
APPLICANT: Hyseq, Inc 

TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
FILE REFERENCE: 7 90CIP3/US 

CURRENT APPLICATION NUMBER: US/ 10/4 50 , 763 
CURRENT FILING DATE: 2003-06-11 
PRIOR APPLICATION NUMBER: PCT/US01/0863 1 
PRIOR FILING DATE: 2001-03-30 
PRIOR APPLICATION NUMBER: 09/540,217 
PRIOR FILING DATE: 2000-03-31 
PRIOR APPLICATION NUMBER: 09/649,167 
PRIOR FILING DATE: 2000-08-23 
NUMBER OF SEQ ID NOS : 60736 
SOFTWARE: Custom 
SEQ ID NO 43858 
LENGTH: 716 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY: DOMAIN 
LOCATION: (235).. (278) 

OTHER INFORMATION: 1 kw MIND INHIBITOR DIVISION CONTROL domain identified by 
OTHER INFORMATION: e MATRIX, accession number DM01756B, p-value^4 . 822e-15 , 
raw score of 

OTHER INFORMATION: 22.91 
US- 10-450 -763 -4385 8 

Query Match 4.2%; Score 84.5; DB 6; Length 716; 

Best Local Similarity 22.7%; Pred. No. 14; 

Matches 61; Conservative 30; Mismatches 89; Indels 89; Gaps 16; 

Qy .138 YGGLW EKDQLS I AKKHDPLLREFQEGRL - LF PPTYKF DRNSNDYDT 182 

II I II I I I =hl II hi = 1 = 

Db 427 YGESWNLLRADQRLIFAKSWPRASRYQQGHQDLFILRSDLPSQVFIRDKLMERRNRRTGR 486 

Qy 183 SEKKRKPAWTDRIL- -WRLKRQPCAGPD TPIPPASHFSLSLRGYSSHMTY 230 

= 111 ||l=l= II h I = hill 
Db 4 87 TE KAR I WE VTDRTVRTW I GE AVAAAAADGVTFS VPVTPHTF RHSYAMHMLYAGI PL 542 

Qy 231 GISDHKPVSGT FDLELKPLVSAPLIV LMPE 260 

= II =1 I I h hll = =h 

Db 543 KVLQSLMGHKSISSTEVYTKVFALD VAAPAR I PKQQQNE KYQ VPQFDQS T I KN I E S 5 98 

Qy 2 61 DLWT VEN - DMMVS YS STSDFPS S PWDW IGL YKVGLRDVND YVS YAWVGD 3 08 

I I = = M h I III 1 = 1=1 III 

Db 5 99 AKGLD VWD S W PLQNADGTVAE YNG YHWF AL AG S PKD ADDTS I YMFYQKVGD 650 



Qy 



309 SKVSCSDNLNQVYIDI SNIPTTEDE 333 



Db 651 NSIDSWKNAGRVFKDSDKFDANDPILKDQ 67 9 



RESULT 14 

US-10-450-763-48706 

; Sequence 48706, Application US/10450763 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
; FILE REFERENCE: 7 90CIP3/US 

; CURRENT APPLICATION NUMBER : US/ 10/450 , 763 
; CURRENT FILING DATE: 2003-06-11 

PRIOR APPLICATION NUMBER: PCT/USOl/08631 

PRIOR FILING DATE: 2001-03-30 

PRIOR APPLICATION NUMBER: 09/540,217 

PRIOR FILING DATE: 2000-03-31 

PRIOR APPLICATION NUMBER: 09/649,167 
; PRIOR FILING DATE: 2000-08-23 
; NUMBER OF SEQ ID NOS : 60736 
; SOFTWARE: Custom 
; SEQ ID NO 48706 
LENGTH: 731 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY: DOMAIN 
LOCATION: (620) . . (635) 

OTHER INFORMATION: GLYCOSYL HYDROLASE FAMILY 1 SIGNATURE domain identified 

by 

OTHER INFORMATION: eMATRIX, accession number PR00131A, p-value=l . 563e-ll , 
raw score of 

OTHER INFORMATION: 12.98 
FEATURE : 

NAME/KEY: DOMAIN 

LOCATION: (531) . . (719) 
; OTHER INFORMATION: Glycosyl hydrolase family 1 domain identified by PFam, 

OTHER INFORMATION: accession name Glyco_hydro_l , E-value=l . 3e-43 , PFam score 
of 155.2 

US-10-450-763-4 87 06 



Query Match 4.2%; Score 84.5; DB 6; Length 731; 

Best Local Similarity 18.5%; Pred. No. 14; 

Matches 74; Conservative 48; Mismatches 119; Indels 15 9; Gaps 19; 

Qy 54 GGVNI CLK - LYG YYVS I INC - HLP PHISNNYQRLEHF 88 

I :| II I Ull U hi II = I 

Db 343 GNINNALKYAKGEFVS I FDCDHVPTRSFLQMTMGWFLKEKQLAMMQTPH HFFSPDPF 399 



Qy 89 DRILEMQNCEGRDIPNILDHDLIIWFG DMNFRIEDFGLHFVRES IKNRCYGGLWE 143 

= 11 III: -'I : I I M hi I 

Db 400 ERNLG RFRKTPN EGTLFYGLRQSWEFNAR GLELVKEGRAQACVSAGNT 447 



Qy 144 KDQLSIAKKHDPLLR- -EFQEGRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKR 201 

: :|| ||: | | |:| | := : = 

Db 44 8 GALMGLAKL LLKPLEGQNHRMLL RWEAKLAALVRL 4 82 

Qy 2 02 QPCAGPDTPIPPAS HFSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSA 2 52 



Db 



4 83 QENTAQPVPFAPNNARPLTLEDDRLSCTVRGYNFAITFSKMSGKPTSW QLVAS 535 



Qy 253 PLIVLM PEDLWTV ENDMMVSYSSTSDFPS - 281 

I : Mhl II = : =1 = 

Db 53 6 ALATKIAHEVNPQNQVGCMLAGGNFYPYSCKPEDVWAALEKDRENLFFIDVQARGTYPAY 5 95 

Qy 282 S P WD W I GL YKVGLRD VND YVS YAWVGD S KVS CSDNLNQ VY I D I SN I 327 

-■II l = = 1 = 11 = :: I II 
Db 5 96 S AR VFREKGVT I NKAPGD DE I LKNTVDFVS FS YYASRCAS AEMNANN 642 

Qy 328 PTTEDEFLLCYYSNSLRS WGISRPF - QI PPGSLREDPLG 366 

l== I = I = I = Mil 

Db 643 SSAANWKSLRNPYLQVSDWGWGIDPLG 670 



RESULT 15 

PCT-US04 -17965-2224 

; Sequence 2224, Application PC/TUS0417965 
; GENERAL INFORMATION: 



APPLICANT 


ARBORGEN, LLC 


APPLICANT 


BLOKSBERG, LEONARD N. 


APPLICANT 


BRYANT, CATHERINE 


APPLICANT 


CONNETT, MARIE B. 


APPLICANT 


EMERSON, SARAH JANE 


APPLICANT 


FROST, MICHAEL J. 


APPLICANT 


FORSTER, RICHARD LLEWELLYN SYDNEY 


APPLICANT 


GRIGOR, MURRAY 


APPLICANT 


HIGGINS, COLLEEN 


APPLICANT 


LASHAM, ANNETTE 


APPLICANT 


LUND, STEVEN TROY 


APPLICANT 


MAGUS IN, ANDREAS 


APPLICANT 


PHILLIPS, JONATHAN 


APPLICANT 


PUTHIGAE, SATHIAH 


APPLICANT 


VEERAKONE , STELLA 


APPLICANT 


WESTWOOD, CLAIRE 


APPLICANT 


GAUSE, KATRINA 


APPLICANT 


WOOD, MARION 


APPLICANT 


ROTTMAN, WILLIAM 


APPLICANT 


HAVUKKALA , ILKKA 



; TITLE OF INVENTION: TRANSCRIPTION FACTORS 
; FILE REFERENCE: 044463-0296 

; CURRENT APPLICATION NUMBER: PCT/US04/ 17965 

; CURRENT FILING DATE: 2004-06-07 

; PRIOR APPLICATION NUMBER: 60/476,189 

; PRIOR FILING DATE: 2003-06-06 

; NUMBER OF 3EQ ID NOS : 3 679 

SOFTWARE: Patent In Ver. 3.2 
; SEQ ID NO 2224 

LENGTH: 3 80 

TYPE : PRT 

ORGANISM: Pinus radiata 
PCT-UC0*:-- 17965 -2224 

Query Match 4.2%; Score 84; DB 1; Length 380; 

Best Local Similarity 21.4%; Pred. No. 6.7; 

Matches 65; Conservative 37; Mismatches 96; Indels 106; Gaps 16 



Qy 92 LEMQNCEGRDIPNILDHDLIIWFGDMNFRIEDFGLHFVRESIKNR-CYGGL 141 

I t = I II hh h : : UN h II 

Db 100 LE VQKRR I YD I TNVLE G IGL I EKRLKNRI CWKGLS VSRPGEVE 142 

Qy 142 WEKDQLS I AK- KHDPLLREFQEG -RLLFPPTYKFDRNSNDYDTSEKKRKPAW - 191 

I I h: : I I :h II hi II I I 

Db 143 DEATVLQAEVDS LNLEECKLDDC I RDMQERLRI L SEDDRNRRWL 186 

Qy 192 --TDRILWRL-KRQPCAGPDT PIP PASHFSLSLRGYSSHMT 229 

II 'h II II 2 II I = : II I 
Db 187 YVTDE D I KKLPCFQNDTL I AI KAPHGTTLEVPDPDEAVEYPQRRYQ I LLRS T 238 

Qy 230 YGISDHKPVSGTFDLELKPLVSAPLIV-LMPEDLWTVENDMMVSYSSTSDFP 280 

| | || |= == = | |:| : I : | : | 

Db 239 MGPIDVYLVS-QFEENIEEMNPVDLATELVPSGICPAEGVTISSVQEGATFVEMECQGHE 2 97 

Qy 281 SSPWDWI-GLYKVGLRDVNDYVSYAWVGDSKVSCSD NLNQVYIDISNI 327 

•'II I h :: II I : II I =| | = :: = 

Db 298 TRQPCTELTSPQDSAGGIMRIVPSDANIDADYWLLSDSGVGITDMWRTETNSTWDEVIEL 357 

Qy 328 PTTE 331 

h 

Db 358 NPTD 361 



Search completed: February 10, 2005, 13:02:13 
Job time : 25 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



February 10, 2005, 17:03:41 ; Search time 166 Seconds 

{without alignments) 
866.716 Million cell updates/sec 

US-09-892-287-1 
2020 

1 MD VL S P L S F I K VS HVRMQG I FQI PPGSLREDPLGEAQPQI 372 



BLOSUM62 
Gapop 10.0 



2105692 



Gapext 0 . 5 

Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters: 

Minimum D3 seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2 003as : * 

7 : geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred . No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



Wo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2020 


100 


0 


372 


2 


AAW97094 


Aaw97094 


Phosphati 


2 


2020 


100 


0 


448 


6 


AAE36070 


Aae36070 


Human pho 


3 


2012 


99 


6 


478 


6 


AAE36067 


Aae36067 


Human pho 


4 


2010 


99 


5 


372 


6 


ABO07198 


Abo07198 


Human p53 


5 


2010 


99 


5 


372 


6 


AAE36069 


Aae36069 


Human pho 


6 


2010 


99 


5 


448 


6 


AAE36068 


Aae36068 


Human pho 


7 


2010 


99 


5 


448 


6 


ABO53040 


Abo53040 


Human put 


& 


1025 


50 


7 


240 


8 


ADN61442 


Adn61442 


Human KPP 


9 


872.5 


43 


2 


1001 


5 


AAU98903 


Aau98903 


Rat inosi 



10 


870. 


5 


43 


.1 


638 


8 


ADR08504 


Adr08504 


Human pro 


11 


868 . 


5 


43 


.0 


639 


7 


ADB65114 


Adb65114 


Human pro 


12 


868 . 


5 


43 


.0 


639 


7 


ADM04740 


Adm0474 0 


Human pro 


13 


868 . 


5 


43 


.0 


1006 


5 


AAU98904 


Aau98904 


Human ino 


14 


868 . 


5 


43 


.0 


1006 


6 


ABU54583 


Abu54583 


Human NOV 


15 


831 . 


5 


41 


.2 


751 


5 


AAU98902 


Aau98902 


Human ino 


16 


831. 


5 


41 


.2 


1056 


6 


AAE36066 


Aae36066 


Human pho 


17 


683 . 


5 


33 


.8 


279 


4 


AAG73981 


Aag73981 


Human col 


18 


637. 


5 


31 


.6 


254 


6 


ABR01000 


AbrOlOOO 


Human gen 


19 


637. 


5 


31 


.6 


254 


6 


ADA98192 


Ada98192 


Human sec 


20 


637. 


5 


31 


.6 


255 


3 


AAB27797 


Aab2 7797 


Human sec 


21 


541 . 


5 


26 


.8 


162 


5 


ABP43025 


Abp43025 


Human ova 


22 


507 . 


5 


25 


. 1 


508 


4 


ABB64662 


Abb64662 


Drosophil 


23 


500. 


5 


24 


.8 


397 


7 


ADJ68405 


Adj 68405 


Human he a 


24 


4 94 . 


5 


24 


.5 


381 


3 


AAB27845 


Aab2 7845 


Sequence 


25 


460. 


5 


22 


.8 


357 


4 


ABB62704 


Abb62704 


Drosophil 


26 


460. 


5 


22 


.8 


357 


5 


AAU76369 


Aau76369 


Drosophil 


27 


432 . 


5 


21 


.4 


749 


4 


AAB95181 


Aab95181 


Human pro 


28 


428 


21 


.2 


382 


3 


AAB27846 


Aab2784 6 


Protein f 


29 


423 . 


5 


21 


.0 


776 


8 


ADT49828 


Adt49828 


Murine IN 


30 


367 . 


5 


18 


.2 


968 


7 


ADJ69678 


Adj 69678 


Human he a 


31 


328 


16 


.2 


1244 


7 


ADJ70439 


Adj70439 


Human hea 


32 


328 


16 


.2 


1269 


6 


ABU08111 


Abu08111 


Human kin 


33 


328 


16 


.2 


1574 


7 


ADE55582 


Ade55582 


Rat Prote 


34 


328 


16 


.2 


1575 


7 


ADE55584 


Ade55584 


Human Pro 


35 


327 . 


5 


16 


.2 


1443 


7 


ADD46317 


Add46317 


Human Pro 


36 


327 . 


5 


16 


.2 


1443 


7 


ADE63139 


Ade63139 


Human Pro 


37 


327 . 


5 


16 


.2 


1443 


7 


ADJ69292 


Adj 6 92 92 


Human hea 


38 


327 . 


5 


16 


.2 


1443 


7 


ADN95796 


Adn95796 


Human BEC 


39 


327 . 


5 


16 


.2 


1496 


8 


ADR14671 


Adrl4 671 


Human NF- 


40 


327 . 


5 


16 


.2 


1496 


8 


ABM80825 


Abm80825 


Tumour-as 


41 


327 . 


5 


16 


.2 


1510 


5 


ABB76319 


Abb76319 


Human pro 


42 


32 


6 


16 


. 1 


1248 


7 


ADD46315 


Add4 6315 


Rat Prote 


43 


326 


16 


.1 


1248 


7 


ADE63137 


Ade63137 


Rat Prote 


44 


316 . 


5 


15 


.7 


1114 


5 


AAU76370 


Aau76370 


Drosophil 


45 


316. 


5 


15 


.7 


1218 


4 


ABB62412 


Abb62412 


Drosophil 



ALIGNMENTS 



RESULT 1 
AAW97094 

ID AAW97094 standard; protein; 372 AA. 
XX 

AC AAW97094; 
XX 

DT 28-APR-1999 (first entry) 
XX 

DE Phosphatidylinositol 4, 5 -bisphospha te 5 -phosphatase . 
XX 

KW Human; phosphacidylinositol 4 , 5 -bisphosphate 5 -phosphatase ; PBPP; 
KW Incyte clone 6387 89; antagonist; immune disorder; cancer; 
KW neuronal disorder; human tubby homologue . 
XX 

OS Homo sapiens . 
XX 



FH 


Key 




Location/Qualifiers 




FT 


Modif ied- 


•site 


38 








FT 






/note= 


"potential 


phosphorylation 


site" 


FT 


Modif ied- 


■site 


132 








FT 






/note= 


"potential 


phosphorylation 


site" 


FT 


Moclif ied- 


■site 


170 








FT 






/note= 


"potential 


phosphorylation 


site" 


FT 


Modif ied- 


■site 


183 








FT 






/note- 


"potential 


phosphorylation 


site" 


FT 


Modif ied- 


■site 


192 








FT 






/note= 


"potential 


phosphorylat ion 


site" 


FT 


Modif ied- 


site 


275 








FT 






/note= 


"potential 


phosphory 1 a t i on 


site" 


FT 


Modif ied- 


site 


282 








FT 






/note= 


"potential 


phosphorylation 


site" 


FT 


Modif ied- 


site 


295 








FT 






/note= 


"potential 


phosphorylation 


site" 


FT 


Modif ied- 


site 


312 








FT 






/note= 


"potential 


phosphory 1 at ion 


site" 


FT 


Modified- 


site 


329 








FT 






/note= 


"potential 


phosphorylation 


site" 


FT 


Modif ied- 


site 


330 








FT 






/note= 


"potential 


phosphorylat ion 


site" 


FT 


Modif ied- 


site 


359 








FT 






/note= 


"potential 


phosphorylation 


site" 



XX 

PI? WO9900507-A1. 
XX 

PD 07-JAN--1999. 
XX 

PF 26-JUN-1998; 98WO-US0133 99 . 
XX 

PR 27-JUN-1997; 97US- 00884681 . 
XX 

PA (INCY-) INCYTE PHARM INC. 
XX 

PI Hillman JL, Lai P, Corley NC, Shah P; 
XX 

DR WPI; 1999-095752/08. 

DR N-PSDB; AAX15254 . 
XX 

PT Phosphatidylinositol 4 , 5-bisphosphate 5 -phosphatase - used for treating 

PT immune disorders, cancers, and neuronal disorders. 

XX 

PS Claim 1; Fig 1A-G; 81pp; English. 
XX 

CC The present sequence encodes a human phosphatidylinositol 4,5- 

CC bisphosphate 5-phosphatase (PBPP) protein. The PBPP polynucleotide was 

CC first identified in Incyte clone 638789 from the breast cancer cDNA 

CC library BRSTN0T03 . Antagonists of the PBPP protein can be used in the 

CC treatment or prevention of an immune disorder, a cancer, or a neuronal 

CC disorder. The PBPP polynucleotide can be used for the detection of 

CC polynucleotides encoding human tubby homologue. The immune disorders that 

CC can be treated include AIDS, Addison's disease, adult respiratory 

CC distress syndrome, allergies, anaemia, asthma, atherosclerosis, Crohn's 

CC disease, ulcerative colitis, atopic dermatitis, gout, Grave's disease, 

CC irritable bowel syndrome, lupus erythematosus, multiple sclerosis, 



CC myasthenia gravis, osteoarthritis, osteoporosis, rheumatoid arthritis, 

CC scleroderma, and autoimmune thyroditis. Also, complications of cancer, 

CC haemodialysis, extracorporeal circulation, infection, and trauma can be 

CC treated using the antagonist. The neuronal disorders that can be treated 

CC include Alzheimer's disease, amnesia, catatonia, amyotrophic lateral 

CC sclerosis, dementia, depression, Down's syndrome, epilepsy, Huntington's 

CC disease, multiple sclerosis, neurofibromatosis, Parkinson's disease, 

CC paranoid psychoses, schizophrenia and Tourette ' s syndrome 
XX 

SQ Sequence 372 AA; 



Query Match 100.0%; Score 2020; DB 2; Length 372; 

Best Local Similarity 100.0%; Pred. No. 5.9e-204; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

1 1 II II I III 1 1 II M MM I M II I II! II MM I I II 1 1 MM I II 1 1 Ml II II 1 1 1 

1 MDVLS PLSFI KVSHVRMQGI LLLVFAKYQHLP YI QI LSTKSTPTGLFG YWGNKGGVNI CL 60 



Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 



61 KL YGYYVS I INCHLPPH ISNNYQRLEHFDRI LEMQNCEGRD I PN1LDHDL I IWFGDMN FR 12 0 

1 1 1 1 II II I 1 1 1 1 I Ml 1 1 II 1 1 II Ml 1 1 II I II 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 III I 

61 KL YGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 12 0 
121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 II I I M 1 1 1 1 II I II III II 1 1 II 1 1 II 1 1 1 1! 1 1 1 1 1 1 I II MM 1 1 MM I M I II 

IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 



121 



180 



181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

I II I 1 1 II I 1 1 II II II II II 1 1 1 1 1 1 1 1 1 M II I M 1 1 I II 1 1 I II I II I II II 1 1 1 1 

181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 
241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

I II I II II I I II I I I II 1 1 II 1 1 1 1 II I II M I II I II 1 1 II II III M 1 1 1 MM I M 

241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 



Qy 

Db 

Qy 
Db 



3 01 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

M M II I II II 1 1 1 II II 1 1 1 II 1 1 II 1 1 1 III II III I MM I III M I Ml I II 1 1 1 

3 01 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

3 61 REDPLGEAQPQI 3 72 

llllllllllll 
3 61 REDPLGEAQPQI 372 



RESULT 2 
AAE36070 

ID AAE36070 standard; protein; 448 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 



AAE36070; 

26-JUN-2003 (first entry) 

Human phosphatidylinositol biphosphate (PIB) protein #5. 

Human; p53 pathway; phosphatidylinositol biphosphate; PIB; therapeutic; 
cancer; apoptosis; cell proliferation; cytostatic. 



OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 12. .326 

FT /note= "Inositol polyphosphate phospatase family, 

FT catalytic (IPPC) domain" 

XX 

PN WO200299125-A1. 
XX 

PD 12-DEC-2002. 
XX 

PF 03-JUN-2002; 2002WO-US017526 . 
XX 

PR 05-JUN-2001; 2 0 01US - 02 96076P . 

PR 10-CCT-2001; 2001US- 0328605P . 

PR 15-FE3-2002; 2002US- 0357253P . 
XX 

PA (EXEL-) EXELIXIS INC. 
XX 

PI Friedman L, Plowman GD, Belvin M, Francis -Lang H, Li D, Funke RP; 
XX 

DR WPI; 2003-167348/16. 
XX 

PT Identifying candidate p53 pathway-modulating agents, useful as 

PT therapeutic targets for disorders associated with defective p53 function, 

PT comprises screening for agents that modulate phosphatidylinositol 

PT biphosphate activity. 

XX 

PS Claim 13; Page 133-135; 139pp; English. 
XX 

CC The present invention relates to a method of identifying candidate p53 

CC pathway-modulating agents. The method involves screening for agents that 

CC modulate the activity of phosphatidylinositol biphosphate (PIB) . The 

CC methods are useful for identifying candidate p53 pathway-modulating 

CC agents used as therapeutic targets for disorders associated with 

CC defective p53 function. They are also useful for modulating p53 pathway 

CC in a mammalian cell or for diagnosing or treating a disease associated 

CC with defective p53 function, e.g. cancers such as breast cancer, colon 

CC cancer, lung cancer or ovarian cancer. Sequences of the invention are 

CC useful for identifying and testing agents that modulate PIB function. The 

CC animal models are useful for in vivo assays to test the activity of 

CC candidate p53 -modulating agents or to assess the role of PIB in a p53 

CC pathway process such as apoptosis or cell proliferation. The present . 

CC sequence is human PIB protein 

XX 

SQ Sequence 44 8 AA; 

Query Match 100.0%; Score 2020; DB 6; Length 448; 

Best Local Similarity 100.0%; Pred. No. 7.8e-204; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MD VLS PLS F I KVS H VRM QG I LLLVF AKYQHL P Y I Q I LS TKSTPTGLFG YWGNKGGVN I CL 60 

II III II I II II II I II 1 1 1 1 II 1 1 1 II II I III II 1 1 1 1 1 MM III I MM M M M I 

Db 77 MD VLS PLS F I KVSHVRMQGI LLLVFAKYQHLPY I QI LSTKSTPTGLFG YWGNKGGVNI CL 136 

Qy 61 KLYG YYVS I INCHLPPH I SNNYQRLEHFDRI LEMQNCEGRD I PNILDHDL I IWFGDMNFR 120 

1 1 III I II II III 1 1 II II I III II I II II MM I III 1 1 1 II I M II M III M I III I 



Db 137 JCLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 196 

Qy 121 I EDFGLHFVRES I KNRC YGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 18 0 

MM II 1 1 1 III 1 1 1 1 Ml III Ml Ml 1 1 II I MM II II M MUM 1 1 Ml II I 1 1 

Db 197 I EDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

II I II II I II II 1 1 II Ml II I III MM I II 1 1 1 II I III II Mill 1 1 1 1 MM 1 1 

Db 257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

II 1 1 MIIMI 1 1 II I M I II I MM III 1 1 1 1 1 III 1 1 II M I II II II I M III II I 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 76 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSLj 360 

II II III II 1 1 1 II II II MM MM MM 1 1 II II 1 1 1 II 1 1 MM I II I M Ml Ml 

Db 377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLR3WGISRPFQIPPGSL 436 

Qy 3 61 REDPLGEAQPQI 3 72 

MMMIIMM 

Db 43 7 REDPLGEAQPQI 44 8 



RESULT 3 
AAE36067 

ID AAE36067 standard; protein; 478 AA. 
XX 

AC AAE36067; 
XX 

DT 26-JUN-2003 (first entry) 

XX 

DE Human phosphatidylinositol biphosphate (PIB) protein #2. 
XX 

KW Human; p53 pathway; phosphatidylinositol biphosphate; PIB; therapeutic; 

KW cancer; apoptosis; cell proliferation; cytostatic. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 56 

FT /label= Unknown 

XX 

PM WO200299125-A1. 
XX 

PD 12-DEC-2002. 
XX 

PF 03-JUN-2002; 2002WO-US017526 . 
XX 

PR 05-JUN-2001; 2001US-0296076P . 

PR 10-OCT-2001; 2001US-0328605P . 

PR 15-FEE-2002; 2 002US- 0357253P . 
XX 

PA (EXEL-) EXELIXIS INC. 
XX 

PI Friedman L, Plowman GD, Belvin M, Francis -Lang H, Li D, Funke RP; 
XX 

DR WPI; 2003-167348/16. 



XX 

PT Identifying candidate p53 pathway-modulating agents, useful as 

PT therapeutic targets for disorders associated with defective p53 function, 

PT comprises screening for agents that modulate phosphatidylinositol 

PT biphosphate activity. 

XX 

PS Claim 13; Page 128-129; 139pp; English. 
XX 

CC The present invention relates to a method of identifying candidate p53 

CC pathway-modulating agents. The method involves screening for agents that 

CC modulate the activity of phosphatidylinositol biphosphate (PIB) . The 

CC methods are useful for identifying candidate p53 pathway -modulating 

CC agents used as therapeutic targets for disorders associated with 

CC defective p53 function. They are also useful for modulating p53 pathway 

CC in a mammalian cell or for diagnosing or treating a disease associated 

CC with defective p53 function, e.g. cancers such as breast cancer, colon 

CC cancer, lung cancer or ovarian cancer. Sequences of the invention are 

CC useful for identifying and testing agents that modulate PIB function. The 

CC animal models are useful for in vivo assays to test the activity of 

CC candidate p53 -modulating agents or to assess the role of PIB in a p53 

CC pathway process such as apoptosis or cell proliferation. The present 

CC sequence is human PIB protein 

XX 

SQ Sequence 478 AA; 

Query Match 99.6%; Score 2012; DB 6; Length 478; 

Best Local Similarity 99.7%; Pred. No. 6e-203; 

Matches 371; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I I I I i I II I I I I I I M I I I I I I II ' I I IN I I I I ! I II I I I I I I I II I I I I I M I I I I 
Db 107 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLFYIQILSTKSTPTGLFGYWGNKGGVNICL 166 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

I II I M II 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 ^ 1 1 MM 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

Db 167 KL YG Y YVS I INCHLP PH I SNNYQRLEHFDR I LEMQNCEGRD I PN I LDHDL 1 1 WFGDMNFR 226 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

I M II Ml 1 1 1 1 I I M 1 1 1 1 1 1 II M I IM II I M Ml I II I II I M 1 1 1 II 1 1 1 1 1 1 1 

Db 22 7 IEDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSND Y 2 86 

Qy 131 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

II Mil I IN I II Ml! 1 1 II I II MUM Mill MM Mill MM II II II Mill 

Db 287 DTSEKKRKPAWTDRILWRLKRQACAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 346 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

MIIIIIMI M I I I M I I I I II II I I II I III I I I III II I II I III II II I I I I I I I 
Db 347 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 4 06 

Qy 3 01 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 50 

I II 1 1 II 1 1 M 1 1 I M 1 1 MIMI 1 1 IIMM I II M MM III III 1 1 II III 1 1 M 

Db 407 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 466 

Qy 361 REDPLGEAQPQI 3 72 

MMIMIMM 
Db 4 67 REDPLGEAQPQI 478 



RESULT 4 
ABO07198 

ID ABO07198 standard; protein; 372 AA. 
XX 

AC ABO07198; 
XX 

DT 13-AUG-2003 (first entry) 
XX 

DE Human p53 modifying protein, SEQ ID 158. 
XX 

KW Human; p53 modifier; cytostatic; cancer; cytostatic; antiangiogenic ; 

KW antiapoptotic; p53 pathway; breast cancer; colon cancer; kidney cancer; 

KW lung cancer; ovarian cancer; angiogenesis ; cell cycle; 

KW apoptotic disorder; cell proliferation disorder. 
XX 

OS Kcmo sapiens . 
XX 

PN WO200299122-A1 . 

xx • 

PD 12-DEC-2002. 
XX 

PF 03-JUN-2002; 2002WO-US0173 82 . 

XX 

PR 05-JUN-2001; 2001US-02 96076P . 

PR 10-OCT-2001; 2001US-0328605P . 

?R 15-FE3-2002; 2 002US - 03 57253P . 
XX 

PA (EXEL-) EXELIXIS INC. 

XX 

PI Friedman L, Plowman GD, Belvin M, Francis -Lang H, Li D, Funke RP; " 
XX 

DR WPI; 2003-156859/15. 

DR N-P3DB; ACD13373 . 
XX 

PT Identifying modulators of the p53 pathway for use in treating apoptotic 

PT or cell proliferation disorders, comprises screening for agents that 

FT modulate activity of a human ortholog of genes that modify the p53 

PT pathway in Drosophila. 

XX 

PS Example 2; Page 474-475; 678pp; English. 
XX 

CC The invention relates to identifying (Ml) a candidate p53 pathway 

CC modulating agent, by contacting an assay system comprising a purified HM 

CC polypeptide (human orthologue of genes that modify the p53 pathway in 

CC Drosophila) or nucleic acid with a test agent under conditions, where but 

CC for the presence of the test agent, the system provides a reference 

CC activity, and detecting a test agent-biased activity of the assay system. 

CC Also included are modulating (M2) a p53 pathway of a cell (comprising 

CC contacting a cell defective in p53 function with a candidate modulator 

CC that specifically binds to a HM polypeptide comprising an HM amino acid 

CC sequence, where p53 function is restored), modulating (M3) a p53 pathway 

CC in a mammalian cell (comprising contacting the cell with an agent that 

CC specifically binds an HM polypeptide or nucleic acid) and diagnosing (M4) 

CC a disease in a patient (comprising: (a) obtaining a biological sample 

CC from the patient; (b) contacting the sample with a probe for HM 

CC expression; (c) comparing the results with a control; and (d) determining 



CC whether the comparison indicates a likelihood disease) . (Ml) is useful 

CC for identifying modulators of the p53 pathway. A probe for HM expression 

CC is useful for diagnosing breast, colon, kidney, lung and ovarian cancer, 

CC in a patient, where the cancer has greater than 25 % expression level. 

CC Modulators identified by (Ml) are useful in a variety of diagnostic and 

CC therapeutic applications, where disease or disorder prognosis is related 

CC to defects in the p53 pathway, such as, angiogenesis , apoptotic or cell 

CC proliferation disorders (e.g. cancer) . Another two new methods (M2 and 

CC M3) are useful for modulating the p53 pathway of a cell, thus restoring 

CC the p53 function of the cell, so that the cell undergoes normal 

CC proliferation or progression through the cell cycle. (M2) and (M3) are 

CC also useful for treating defects in the p53 pathway such as angiogenic, 

CC apoptotic or cell proliferation disorders. The present sequence 

CC represents a human p53 pathway modifying protein 
XX 

SQ Sequence 3 72 AA; 

Query Match 99.5%; Score 2010; DB 6; Length 372; 

Best Local Similarity 99.5%; Pred . No. 6.7e-203; 

Matches 370; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

M 1 1 ' 1 1 1 1 1 1 1 1 1 1 ! 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 i I II 1 1 1 1 ! I II I II 1 1 1 

Db 1 MDVLS PLS F I KVSHVRMQG I LLLVFAKYQHLPY I Q I LSTKSTPTGLFGYWGNKGGVNI CL 60 

Qy 61 KLYGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLI IWFGDMNFR 120 

: I I I I II I I I I I I I I I I I II I I I I ' M I I I I ! I I I I I I I II I I II I I I I I II M I I II 
Db 61 KLYGYYVS I INCHLPPHI SNNYQRLEHFDRI LEMQNCEGRD I PNI LDHDL 1 I WFGDMN FR 120 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

IIIMIIIIIIIilMIIII III lllllllillllMIIIIIIIIII IIMIIIII 

Db 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 " 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

1 1 M I! 1 1 1 1 I I ! I II I 1 1 1 I II . II 1 1 II M I II 1 1 1 1 1 1 1 M I II I I II I II 1 1 1 

Db 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

II'MIIIIIIMMIIII III IIIMIIMIIIIIIIIIMIMI IIMIIIII 

Db 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDW1GLYKVGLRDVNDY 3 00 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

I [ 1 1 1 1 1 1 II I M E 1 1 1 M 1 1 1 1 1 E M 1 1 ! 1 1 [ 1 1 ! I M MINIMI MINIMI! 

Db 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSWGIRRPFQIPPGSL 360 

Qy 3 61 REDPLGEAQPQI 3 72 

IIIIIIMIIII 

Db 3 51 REDPLGEAQPQI 3 72 



RESULT 5 
AAE36069 

ID AAE36069 standard; protein; 372 AA. 
XX 

AC AAE36069; 
XX 

DT 25-JUN-2003 (first entry) 



DE Human phosphat idylinositol biphosphate (PIB) protein #4 . 
XX 

KW Human; p53 pathway; phosphatidylinositol biphosphate; PIB; therapeutic; 

KW cancer; apoptosis; cell proliferation; cytostatic. 

XX 

OS Homo sapiens. 
XX 

PN WO200299125-A1. 
XX 

PD 12-DEC-2002. 
XX 

PF 03-JUN-2002; 2002WO-US017526 . 
XX 

PR 05-JUN-2001; 2 00 1US - 02 96076P . 

PR 10-OCT-2001; 2 00 1US - 032 8605P . 

PR 15-FE3-2002; 2 002US - 0357253P . 
XX 

PA (EXEL-) EXELIXIS INC. 
XX 

PI Friedman L, Plowman GD, Belvin M, Francis -Lang H, Li D, Funke RP; 
XX 

DR WPI; 2003-167348/16. 
XX 

PT Identifying candidate pS3 pathway-modulating agents, useful as 

PT therapeutic targets for disorders associated with defective p53 function, 

PT comprises screening for agents that modulate phosphatidylinositol 

PT biphosphate activity. 

XX 

PS Claim 13; Page 131-133; 139pp; English. 
XX 

CC The present invention relates to a method of identifying candidate p53 

CC pathway -modulating agents. The method involves screening for agents that 

CC modulate the activity of phosphatidylinositol biphosphate (PIB) . The 

CC methods are useful for identifying candidate p53 pathway-modulating 

CC agents used as therapeutic targets for disorders associated with 

CC defective p53 function. They are also useful for modulating p53 pathway 

CC in a mammalian cell or for diagnosing or treating a disease associated 

CC with defective p53 function, e.g. cancers such as breast cancer, colon 

CC cancer, lung cancer or ovarian cancer. Sequences of the invention are 

CC useful for identifying and testing agents that modulate PIB function. The 

CC animal models are useful for in vivo assays to test the activity of 

CC candidate p53 -modulating agents or to assess the role of PIB in a p53 

CC pathway process such as apoptosis or cell proliferation. The present 

CC sequence is human PIB protein 

XX 

SQ Sequence 3 72 AA; 

Query Match 99.5%; Score 2010; DB 6; Length 372; 

Best Local Similarity 99.5%; Pred. No. 6.7e-203; 

Matches 370; Conservative 0; Mismatches 2; Indels 0; Gaps 0 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

I I I I I III MM II M I I I M II I II III M MM I II M II I I II I I I II II I I I I II I 
Db 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 



Qy 



61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 



Db 



1 1 1 1 1 i II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 ! 1 1 ! 1 1 1 II 

61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 



Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 II II II I II M III M II II II IMIMI I II I M I III 1 1 1 II II M II MM I III 

Db 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

I II 1 1 II II I Ml 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II II 1 1 1 1 1 1 II I 

Db 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

1 1 1 Ml I II II M I M II II II II MM 1 1 II II I II 1 1 II II I II III I 1 1 MM 1 1 II 

Db 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

Qy 3 01 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 3 60 

I II I II II Mill 1 1 1 III I III I IM Mill I II 1 1 II lllllllll lllllllll 

Db 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSWGIRRPFQIPPGSL 360 

Qy 361 REDPLGEAQPQI 3 72 

MMMMIIM 

Db 3 61 REDPLGEAQPQI 3 72 



RESULT 6 
AAE36068 

ID AAE36068 standard; protein; 448 AA. 
XX 

AC AAE36068; 
XX 

DT 26-0TJN-2003 (first entry) 
XX 

DE Human phosphatidylinositol biphosphate (PIB) protein #3 . 
XX 

KK 7 Hurr.an; p53 pathway; phosphatidylinositol biphosphate; PIB; therapeutic; 

KW cancer; apoptosis; cell proliferation; cytostatic. 

XX 

OS Homo sapiens. 

XX 

PN WO200299125-A1. 
XX 

PD 12-DEC-2002. 
XX 

PF 03-JIJN-2002 ; 2 002WO-US0 1752 6 . 
XX 

PR 05-JUN-2001; 2001US- 02 96076P . 
PR 10-OCT-2001; 2 001US- 0328605P . 
PR 15-FEB-2002; 2 002US- 03 57253P . 
XX 

PA (EXEL-) EXELIXIS INC. 
XX 

PI Friedman L, Plowman GD, Belvin M, Francis-Lang H, Li D, Funke RP; 
XX 

DR WPI; 2003-167348/16. 
XX 

PT Identifying candidate p53 pathway -modulating agents, useful as 

PT therapeutic targets for disorders associated with defective p53 function, 



PT comprises screening for agents that modulate phosphatidylinositol 

PT biphosphate activity. 

XX 

PS Claim 13; Page 130-131; 139pp; English. 
XX 

CC The present invention relates to a method of identifying candidate p53 

CC pathway-modulating agents. The method involves screening for agents that 

CC modulate the activity of phosphatidylinositol biphosphate (PIB) . The 

CC methods are useful for identifying candidate p53 pathway-modulating 

CC agents used as therapeutic targets for disorders associated with 

CC defective p53 function. They are also useful for modulating p53 pathway 

CC in a mammalian cell or for diagnosing or treating a disease associated 

CC with defective p53 function, e.g. cancers such as breast cancer, colon 

CC cancer, lung cancer or ovarian cancer. Sequences of the invention are 

CC useful for identifying and testing agents that modulate PIB function. The 

CC animal models are useful for in vivo assays to test the activity cf 

CC candidate p53 -modulating agents or to assess the role of PIB in a p53 

CC pathway process such as apoptosis or cell proliferation. The present 

CC sequence is human PIB protein 

XX 

SQ Sequence 448 AA; 

Query Match 99.5%; Score 2010; DB 6; Length 44 8; 

Best Local Similarity 99.5%; Pred. No. 8.9e-203; 

Matches 370; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 I I 1 1 1 II II II Mill M 1 1 1 1 1 1 1 1 i 1 1 1 1 I 1 1 1 1 1 1 1 I M 

Db 77 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGWICL 13 6 

Qy 61 KL YG YYVS I INCHLPPHI SNNYQRLEHFDRI LEMQNCEGRD I PNI LDHDL 1 1 WFGDMNFR 120 

1 1 II 1 1 1 I II I M I II M II 1 1 1 1 1 1 MM III II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 I I II I I II 

Db 137 KLYGYYVS I INCHLPPHI SNNYQRLEHFDRI LEMQNCEGRD I PNI LDHDL 1 1 WFGDMNFR 196 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQL3IAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II II I Mi I M 1 1 II 1 1 1 1 1 II i II 1 1 M I II 1 1 h 1 1 1 1 

Db 257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

Qv 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

II 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 II I MM 1 1 M II II II I II 1 1 1 1 1 II I II 1 1 II 1 1 1 1 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVI^DY 376 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 : 

II Ml III II M 1 1 II II 1 1 II IMIIIM 1 1 II 1 1 II MINIMI MIIIIMM 

Db 377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSWGIRRPFQIPPGSL 436 

Qy 361 REDPLGEAQPQ I 3 72 

MIMIMMM 
Db 43 7 REDPLGEAQPQ I 44 8 



RESULT 7 
ABO53040 



ID ABO53040 standard; protein; 448 AA. 
XX 

AC ABO5304 0; 
XX 

DT 10-OCT-2003 (first entry) 
XX 

DE Human putative spliceosome associated protein (SAP) #16. 
XX 

KW Human; SAP; spliceosome associated protein; ribonucleoprotein; 

KW RNP complex; RNA affinity substrate; RNP assembly sequence; 

KW spliceosomal complex; hnRNP complex; mRNA export complex; 

KW mRNA localisation complex; RNA editing complex; intron complex; 

KW H complex; telomerase complex; fragile X protein complex; 

KW, reverse transcriptase complex; gene splicing complex. 

XX 

OS Homo sapiens . 
XX 

PN US2003068803-A1. 
XX 

PD 10-APR-2003. 
XX 

PF 14-JAN-2002; 2 002US - 0004 7 991 . 

XX 

PR 12-JAN-2001; 2 001US - 02 6152 IP . 
XX 

PA (REED/) REED R . 

PA (ZHOU/) ZHOU Z. 
XX 

PI Reed R, Zhou Z; 
XX 

DR WPI; 2003-540885/51. 
XX 

PT Isolating. ribonucleoprotein complex, by contacting RNA affinity substrate 

PT having ribonucleoprotein assembly sequence and affinity tag, with protein - 

PT mixture, subjecting complex formed to chromatography, affinity selection. 

XX 

PS Claim 24; Page; 39pp; English. 
XX 

CC The invention relates to forming (Ml) an isolated ribonucleoprotein (RNP) 

CC complex (C) , involves contacting an RNA affinity substrate (S) comprising 

CC an RNP assembly sequence (AS) and an affinity tag, with a protein mixture 

CC to permit formation of (C) on AS, subjecting (C) to chromatographic 

CC separation, and subjecting (C) to affinity selection, where the affinity 

CC tag (e.g. bacteriophage MS2 coat protein in a fusion protein with E. coli 

CC maltose binding protein) binds to an affinity matrix. Also included are 

CC an isolated spliceosome preparation (isolated by (Ml) ) , a RNA comprising 

CC an RNP complex binding site and at least one phage coat protein 

CC recognition site, a nucleic acid encoding the RNA, and treating (M2) a 

CC subject having a disorder associated with abnormal RNP complexes (by 

CC obtaining a sample of cells from a subject, purifying RNP complexes from 

CC the cells of the subject by (Ml) , determining the presence in the 

CC purified RNP complexes of one or more proteins, and normalising the 

CC amount of RNPs in the subject. (Ml) is useful for forming an isolated RNP 

CC complex selected from a spliceosomal complex (selected from E, A, B and C 

CC complex) , an hnRNP complex, an mRNA export complex, an mRNA localisation 

CC complex, an RNA editing complex, an intron complex, or an H complex. (Ml) 

CC is useful in a diagnostic assay for determining whether a subject has 



CC abnormal RNP complexes, (M2) is useful for treating a subject having a 

CC disorder associated with abnormal RNP complexes. (Ml) is useful for 

CC forming an isolated RNP complex such as a telomerase complex, a fragile X 

CC protein complex, a reverse transcriptase complex or a gene splicing 

CC complex. The present sequence represents a putative novel human 

CC spliceosome associated protein (SAP) isolated by the methods of the 

CC invention. Note: The prsent sequence is not shown in the specification 

CC but was obtained from Genbank or Swissprot using the information provided 

CC in table 2 of the specification 

XX 

SQ Sequence 448 AA; 

Query Match 99.5%; Score 2010;. DB 6; Length 448; 

Best Local Similarity 99.5%; Pred. No. 8.9e-203; 

Matches 370; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

M Mill 1 1 1 1 1 II 1 1 hi! I: II I! 1 1 Mil 1 1 I i I 1 1 1 i! II II II I Mi 1 1 II I I 



1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 III II 1 1 II I MM 1 1 1 II 1 1 1 1 1 1 M I Ml I IM 



I EDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 13 0 

M 1 1 Mill I II I! I Ml MM II Ml I M I M Ml Ml M II 1 1 I! II MM' II IM 

I EDFGLHFVRES I KNRCYGG LWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 2 56 



i ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 E 1 1 1 1 1 M 1 1 E 1 1 1 1 1 1 ! 1 1 11 1 1 1 1 1 1 1 1 1 1 1 



241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

M I It 1 1 1 1 M II 1 1 1 1 1 M 1 1 It 1 1 M M 1 1 1 1 1 1 M M 1 1 1 1 M t M 1 1 ! 1 1 1 1 1 1 

317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 376 



MM MMMMMIIMM III MMMMIMM MINIMI MIIIIIMI 

VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSWGIRRPFQIPPGSL 
REDPLGEAQPQI 372 

IMIIIIIIIII 



Qy 


1 


Db 


77 


yy 




Db 


137 


Qy 


121 


Db 


197 


Qy 


181 


Db 


257 


Qy 


241 


Db 


317 


Qy 


301 


Db 


377 


Qy 


361 


Db 


437 



RESULT 8 
ADN61442 

ID ADN61442 standard; protein; 240 AA. 
XX 

AC ADN6144 2; 
XX 

DT 12-AUG-2004 (first entry) 
XX 

DE Human KPP-8 protein SEQ ID NO : 8 . 
XX 

KW human; kinase; phosphatase; enzyme; KPP; cytostatic; 

KW ant iarteriosclerot ic ; anticonvulsant ; nootropic ; neuroprotective ; 



KW cerebroprotective ; anti-HIV; antiallergic; antiinflammatory; 

KW thyromimetic ; gene therapy; cell proliferative disorder; cancer; 

KW atherosclerosis; neurological disorder; epilepsy; Huntington's disease; 

KW stroke; immune disorder; inflammatory disorder; AIDS; allergy; 

KW developmental disorder; Hypothyroidism; Cushing's syndrome; infection. 

XX 

OS Homo sapiens . 
XX 

PN WO2004042022-A2 . 
XX 

PD 21-MAY-2004. 
XX 

PF 30-OCT-2003; 2 003WO-US034 809 . 
XX 

PR 01-NOV-2002; 2 002US - 04 23226P . 

PR 15-NOV-2002; 2 002US - 042 6713P . 

PR 26-NOV-2002; 2 002US - 042 9766P . 

PR ll-FEB-2003; 2 003US- 0447043P . 
XX 

PA (INCY-) INCYTE CORP. 
XX 

PI Hafalia AJA, Lee S, Murage J, Swarnakar A, Chawla NK, Khare R; 

PI Elliott VS, Tran UK, Ramkumar J, Gururajan R, Baughn MR, Gietzen KJ; 

PI Yang YG, Chien D, Wang JT, Favero KD, Becha SD, Richardson TW; 

PI Jin ?,■ Hawkins PR, Yue H, Lee EA, Marquis JP; 

XX 

OR WPI; 2004-390608/36. 

DK N-PSD3; ADN61498. 
XX 

PT New human kinases and phosphatases (KPP) , useful for diagnosing, treating 

PT and preventing diseases or conditions associated with the aberrant KPP 

PT expression e.g. cancer, AIDS, epilepsy, or infections. 

XX 

PS Claim 1; SEQ ID NO 8; 320pp; English. 

XX 

CC The present sequence represents a human kinase and phosphatase protein 

CC designated KPP-8. Human KPP sequences have cytostatic, 

CC antiarteriosclerotic , anticonvulsant, nootropic, neuroprotective, 

CC cerebroprotective, anti-HIV, antiallergic, antiinflammatory and 

CC thyromimetic activities, and can be used in gene therapy. The human KPP 

CC polypeptides and polynucleotides of the invention are useful in 

CC diagnosing, treating and preventing diseases or conditions associated 

CC with the decreased expression or overexpression of KPP, such as ceil 

CC proliferative (e.g. cancer, atherosclerosis), neurological (e.g. 

CC epilepsy, Huntington's disease, stroke), immune/ inflammatory (e.g. AIDS, 

CC allergies) and developmental (e.g. Hypothyroidism, Cushing's syndrome) 

CC disorders, or infections. They are also useful in assessing the effects 

CC of exogenous compounds on the expression of nucleic acid and amino acid 

CC sequences of KPP. The KPP sequences or their fragments are useful in 

CC screening compounds for effectiveness as agonist or antagonist of the 

CC polypeptides , or in altering the expression of the target polynucleotide 

CC and compounds that specifically bind to or modulate the activity of the 

CC polypeptide. 

XX 

SQ Sequence 240 AA; 

Query Match 50.7%; Score 1025; DB 8; Length 240; 



Best Local Similarity 97.0%; Pred. No. 3.6e-99; 

Matches 191; Conservative 1; Mismatches 5; Indels 0; Gaps 0; 



Qy 176 NSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDH 235 

I : I II M I I I I li 11:1 I II ill III I I I I II I II I I li I M III I III II II I I 
Db 44 NLDIYVIGEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDH 103 

Qy 236 KPVSGTFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPSS PWDW IGL YKVGLR 295 

. 1 1 1 1 : III h 1 1 1 1 1 II II I II II III I II I II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II I II 

Db 104 KPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYS STSDFPSS PWDW IGLYKVGLR 163 

Qy 296 DVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

1 1 1 1 r 1 1 ii i ii in iii 1 1 ii 1 1 1 ii ii ii 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 iii i ii 1 1 ii ii i 

Db 164 DVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 223 

Qy 356 PPGSLREDPLGEAQPQI 372 

1 1 1 1 1 1 1 1 1 r 1 1 1 1 

Db 224 PPGSLREDPLGEAQPQI 24 0 



RESULT 9 
AAU98903 

ID AAU98903 standard; protein; 1001 AA. 
XX 

AC AAUD8903; 
XX 

DT 22-AUG-2002 (first entry) 
XX 

DE Rat inositol polyphosphate 5 -phosphatase . 
XX 

KW Human; inositol polyphosphate 5 -phosphatase ; asthma; diabetes; 

KW chronic obstructive pulmonary disease; cancer; enzyme. 

XX 

OS Rattus norvegicus. 
XX 

PN WO200236755-A2 . 
XX 

PD 10-MAY-2002 . 
XX 

PF 29-OCT-2001; 2 001WO-EP0 124 96 . 
XX 

PR 30-OCT-2C00; 2000US - 0243745P . 

PR 26-DEC-2000; 200 0US - 02573 02P . 

PR 27-AUC-2001; 200 1US - 03 14660P . 
XX 

PA (?ARB ) BAYER AG. 
XX 

PI Kossida S; 
XX 

DR W?I; 2002-490008/52. 
XX 

I?T Njw human inositol polyphosphate 5 -phosphate polypeptide, useful for 

?T treating chronic obstructive pulmonary disease, asthma, diabetes or 

PT cancer. 
XX 

PS Disclosure; Fig 3; 135pp; English. 
XX 



CC The invention relates to novel purified human inositol polyphosphate 5- 

CC phosphatase polypeptide (I) and the polynucleotide (II) encoding it. (I) 

CC and (II) are useful for screening for agents which decrease the activity 

CC of human inositol polyphosphate 5-phosphatase polypeptide. (I) or (II) 

CC are also useful for screening for agents which modulate the activity of 

CC human inositol polyphosphate 5-phosphatase. A reagent that modulates (I) 

CC or (II) is useful for treating a human inositol polyphosphate 5- 

CC phosphatase dysfunction related disease, such as chronic obstructive 

CC pulmonary disease, asthma, diabetes or cancer. The present sequence 

CC represents rat inositol polyphosphate 5- phosphatase 

XX 

SQ Sequence 1001 AA; 

Query Match 43.2%; Score 872.5; DB 5; Length 1001; 

Best Local Similarity 47.5%; Pred. No. 4.2e-82; 

Matches 169; Conservative 56; Mismatches 126; Indels 5; Gaps 4; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I • I • II I I I I I - * I I • I M I I I I : * • : I I III I MM III h: I 
Db 4 83 MDALGPFNFVLVSTVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 542 

Qy 61 KLYGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLI IWFGDMNFR 120 

:h : MINI h I I = = | II = I = I lllllh lllhlll 

Db 54 3 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAHGILDHDLVFWFGDLNFR 602 

Qy 121 IEDFGLHFVRESIKl^RCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

M • Mlh M = IMIIIhMI hh 1 1 1 1 I I MMM M I 

Db 603 IESYDLHFVKFAIDSNQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 662 

Qy 131 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTP I PPASH - FSLSLRGYS SHMTYGI SDHKPVS 23 9 

Ml I I I I I I I I I I U I -I I I I : I I I III I Mllllh 

Db 663 DTSAKKRKPAWTDRILWKVK-APSGGP-SPSGRESHRLQVTQHSYRSHMEYTVSDHKPVA 720 . 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

I h Ihl II Ml -II MMIIIMI I I 

Db 721 ARFLLQFAFRDDVPLVRLEVADEWARPEQAWRYRVETVFARSSWDWIGLYRVGFRHCKD 780 

Qy 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

MM I M MM : = l MM II h - I - II 1 1 

Db 781 YVAYVWAKHEEV- -DGNIYQVTFSEESLPKGHGDFILGYYSHHHSILIGVTEPFQI 834 



RESULT 10 
ADR08504 

ID ADR08504 standard; protein; 638 AA. 
XX 

AC ADR08504; 
XX 

DT 04-NOV-2004 (first entry) 
XX 

DE Human protein useful for treating neurological disease Seq 2010. 
XX 

KW human; oligo-capping method; diagnostic marker; gene therapy; 

KW osteoporosis; neurological disease; Alzheimer's disease; 

KW Parkinson's disease; dementia; short memory; cancer; 

KW sense or motor function; emotional reaction; fear response; panic; 

KW osteopathic; neuroprotective; nootropic; antiparkinsonian; cytostatic; 



KW tranquiliser . 
XX 

OS Homo sapiens. 
XX 

PN EP1447413-A2 . 
XX 

PD 18-AUG-2004. 
XX 

PF 12-FEB-2004 ; 2004EP-00003145 . 
XX 

PR 14-FEB-2003; 2003 JP-00102207 . 

PR 09-MAY-2003; 2003 JP-0013 1452 . 
XX 

PA ( RE AS - ) RES ASSOC BIOTECHNOLOGY. 
XX 

PI Isogai T, Yamamoto J, Nishikawa T, Isono Y, Sugiyama T, Otsuki T; 

PI Wakamatsu A, Ishii S, Nagai K, Irie R; 

XX 

DR WPI; 2004-583265/57. 

DR N-PSDB; ADR06548. 
XX 

PT New 1995 cDNA, useful for treating osteoporosis, neurological diseases, 

PT Alzheimer's diseases, Parkinson's diseases, dementia and various cancers. 
XX 

PS Claim 1; SEQ ID NO 2010; 2686pp; English. 

XX 

CC This invention relates to novel, isolated full length human cDNA 

CC molecules and the encoded proteins thereof. Specifically, it refers to * 

CC cDNA clones obtained by an oligo-capping method, where none of these 

CC clones are identical to any known human mRNAs . The present invention 

CC describes an immunoassay to identify agonists and antagonists, as well as 

CC antibodies, antisense molecules and siRNAs that can all be used to bind 

CC to and modulate expression of the cDNA molecules. As such, these 

CC molecules are useful for diagnostic markers or therapeutic targets for 

CC the various diseases or morbid states. In particular, they are useful in 

CC gene therapy for treating osteoporosis, neurological disease, Alzheimer's 

CC disease, Parkinson's disease, dementia, short memory and various cancers, 

CC as well as for maintaining equilibrium of sense or motor function, and 

CC for treating emotional reaction, fear response and panic. Accordingly, 

CC they exhibit osteopathic, neuroprotective, nootropic, antiparkinsonian, 

CC cytostatic and tranquiliser activities. This polypeptide is a protein 

CC encoded by a full length human cDNA sequence of the invention. NOTE: This 

CC sequence is not given in the sequence listing of the specification but 

CC can be obtained on CD-ROM from the European Patent Office, Vienna Sub- 

CC office. 

XX 

SQ Sequence 63 8 AA; 

Query Match 43.1%; Score 870.5; DB 8; Length 638; 

Best Local Similarity 47.8%; Pred. No. 3.4e-82; 

Matches 170; Conservative 54; Mismatches 12 7; Indels 5; Gaps 4; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I :h II llll|::|hllll 111 = = = I I IN 111111111- I 
Db 12 0 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 17 9 



Qy 



61 KL YG YYVS I INCHLPPHI SNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDLI I WFGDMNFR 120 



:h : MINI \ II : = | I i I llihh 1111 = 111 

Db 180 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 239 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : lllh H : lllllll-M hh I I I I I I 11 = 111 \ I 

Db 24 0 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDAGTNKY 2 99 

Qy 181 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTPI PPASH - FSLSLRGYS SHMT YG I SDHKPVS 239 

III M I I I I I I I I I I I -I I II = 1 II I III I Hllllh 

Db 300 DTS AKKRKPAWTDR I LWKVK - APGGGP - S PSGRKSHRLQVTQHS YRSHME YTVSDHKPVA 357 

Qy 240 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 299 

I I ' Ihl II :| | : | | |||||||:|| | I 

Db 3 53 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 417 

Qy . 3C0 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

Ihl I I I II M :hl III- = = lh I I I I 

Db 418 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 4 71 



RESULT 11 
ADB65114 

ID ADB65114 standard; protein; 63 9 AA. 

XX 

AC ADB65114 , 

XX 

DT 04-DEC-2003 (first entry) 
XX 

DE Human protein encoded by clone SPLEN20024 930 . 
XX 

KW Human; pharmaceutical; diagnostic; gene therapy; tissue regeneration; 

KW cell regeneration; membrane protein; signal transduction-related protein; 

KW transcription-related protein; osteoporosis; neurological disease; 

KW cancer; tumour. 

XX 

OS Homo sapiens. 
XX 

PN EP1308459-A2 . 

XX 

PD 07-MAY-2003. 
XX 

?F 28-MAR-2002; 2 002EP- 000074 01 . 
XX 

PR 05-NOV-2001; 2 001JP- 003 792 98 . 
PR 25-JAN-2002; 2 002US- 003 50978 . 
XX 

PA (HELI-) HELIX RES INST. 

PA (REAS-) RES ASSOC BIOTECHNOLOGY. 

XX 

PI Isogai T, Sugiyama T, Otsuki T, Wakamatsu A, Sato H, Ishii S; 

PI Yamamoto J, Isono Y, Hio Y, Otsuka K, Nagai K, Irie R, Tamechika I; 

PI Seki N, Yoshikawa T, Otsuka M, Nagahari K # Masuho Y; 

XX 

DR WPI; 2003-450961/43. 
DR N-PSDB; ADB63144 . 
XX 

PT New polynucleotides and polypeptides, useful for developing a diagnostic 



PT marker or medicines for regulation of their expression and activity, or 

PT as targets of gene therapy. 

XX 

PS Claim 1; Page; 222pp; English. 
XX 

CC The invention discloses a polynucleotide comprising a sequence selected 

CC from 1970 fully defined nucleotide sequences which encode novel 

CC polypeptides. Also claimed is a polypeptide encoded by the polynucleotide 

CC or its partial peptide, an antibody binding to the polypeptide or peptide 

CC of the polynucleotide, immunologically assaying the polypeptide or 

CC peptide of the polynucleotide by contacting the polypeptide or peptide 

CC with the antibody of the encoded protein, and observing the binding 

CC between the two, a transformant carrying the polynucleotide in an 

CC expressible manner and an antisense polynucleotide. The oligonucleotide 

CC is useful as a primer for synthesising the polynucleotide, or as a probe 

CC for detecting the polynucleotide. The polynucleotides and encoded 

CC proteins are useful as pharmaceutical agents and many disease-related 

CC genes may be included in them, for developing a diagnostic marker or 

CC medicines for regulation of their expression and activity, or as targets 

CC of gene therapy. The genes are involved in tissue and/or cell 

CC regeneration. Membrane proteins, signal transduction-related proteins, 

CC transcription-related proteins, disease-related proteins and genes 

CC encoding them can be used as indicators for diseases (e.g. osteoporosis, 

CC neurological diseases, cancer, tumours. The cDNA may be used to regulate 

CC the activity or expression of the encoded protein to treat diseases. The 

CC sequence presented is a protein of the invention. Note: Some of the 

CC sequence data for this patent is net represented in the printed 

CC specification, but is based on sequence information supplied by the 

CC European Patent Office. 

XX 

SQ Sequence 639 AA; 

Query Match 43.0%; Score 868.5; DB 7; Length 639; 

3eot Local Similarity 47.8%; Pred. No. 5.6e-82; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; Gaps 4; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I :h II llll|::||:|lll 111 = = = = 1 I III 111111111 = : I 
Db 121 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 180 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

= h = =11111 h II = = l II =1 =1 111111= lllhlll 

Db 181 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 240 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 18 0 

II = 1111= =1 = Mlllll-ll hl= II I I I I 11 = 111 =1 I 

Db 241 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 3 00 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III I Ml I I II ' I III-! I II =| II == I Ml I =111111 = 

Db 301 DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 358 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

11= 11= I I I =11=11 1111111 = 11 I I 

Db 359 AQFLLQFAFRDDMPLWLEVADEWVRPEQAVVRYRMETVFARSSWDWIGLYRVGFRHCKD 418 

Qy 3 00 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 3 55 



Ihl I I I II ::| = | = | I I I = = = = 11 = I I I I 

Db 419 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 472 



RESULT 12 
ADM04740 

ID ADM04740 standard; protein; 639 AA. 
XX 

AC ADM04 74 0; 
XX 

DT 20-MAY-2004 (first entry) 
XX 

DE Human protein of the invention SEQ ID NO: 3425. 
XX 

KW human; gene therapy; diagnostic marker; pharmaceutical. 
XX 

OS Homo sapiens . 
XX 

PN EP1347046-A1. 
XX 

PD 24-SEP-2003 . 
XX 

PF 12-APR-2002; 2 002EP- 000084 00 . 
XX 

PR 22-MAR-2002; 2 002 JP- 001377 85 . 

XX 

?A ( RE AS - ) RES ASSOC BIOTECHNOLOGY. 
XX 

PI Isogai T, Sugiyama T, Otsuki T, Wakamatsu A, Sato H, Ishii S; 

PI Yamamoto J, Isono Y, Hio Y, Otsuka K, Nagai K, Irie R, Tamechika I; 

PI Seki N, Yoshikawa T, Otsuka M, Nagahari K, Masuho Y; 

XX 

DR WPI; 2003-723553/69. 

DR N-PSDB; ADM022 97. 
XX 

PT New polynucleotides and polypeptides are useful in gene therapy, for 

PT developing a diagnostic marker or medicines for regulating their 

PT expression and activity, or as a target of gene therapy. 

XX 

PS Claim 1; SEQ ID NO 3425; 305pp; English. 
XX 

CC The invention relates to a novel human polynucleotide and the encoded 

CC polypeptide. A polynucleotide of the invention may have a use in gene 

CC therapy. An oligonucleotide of the invention ADM062 02 -ADM06773 is useful 

CC as a primer for synthesizing the polynucleotide or as a probe for 

CC detecting the polynucleotide. The polynucleotides ADM01316-ADM03758 are 

CC useful in gene therapy, for developing a diagnostic marker or medicines 

CC for regulating their expression and activity, or as a target of gene 

CC therapy. The proteins ADM03759-ADM062 01 encoded by the polynucleotides 

CC are useful as pharmaceutical agents. The present sequence represents a 

CC protein sequence of the invention. 
XX 

SQ Sequence 63 9 AA; 



Query Macch 43.0%; Score 868.5; DB 7; 

Best Local Similarity 47.8%; Pred. No. 5.6e-82; 
Matches 170; Conservative 54; Mismatches 127; 



Length 63 9; 

Indels 5; Gaps 4 ; 



Qy 1 MDVLS PLS F I KVSHVRMQG I LLLVFAKYQHLP YIQI LSTKSTPTGLFG YWGNKGGVNI CL 60 

II I I = h II I I I : I = I I I I I I III::: = I I III MINIMI- I 
Db 121 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 180 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

:h : MUM I II ::| II M M IMMM I I I I : I I I 

Db 181 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 240 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : IMM I : IMIMMMI MM 1 1 1 1 I I IMM I I 

Db 241 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 300 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III I I 1 I I I I I II I i N M I II M || :: | Ml | =111111 = 

Db 3 01 DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 358 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

M I h I II H I : I I Mllllhll I I 

Db 3 59 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 418 

Qy 3 00 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 3 55 

11 = 1 I I I II -I :| = | llh= =:|h MM 

Db 419 YVAYVWAKHEDV - - DGNTYQVTFSEESLPKGHGDF I LG YYSHNHS I L IG I TE PFQ I 4 72 



RESULT 13 
AAU98 904 

ID AAUS89C'! standard; protein; 1006 AA. 
XX 

AC AAU98 904; 
XX 

DT 22-AUG-2002 (first entry) 
XX 

DE Human inositol polyphosphate 5-phosphatase #2. 
XX 

KW Human; inositol polyphosphate 5-phosphatase; asthma; diabetes; 

KW chronic obstructive pulmonary disease; cancer; enzyme. 

XX 

OS Homo sapiens. 
XX 

PN WO200236755-A2 . 
XX 

PD 10-MAY-2002 . 
XX 

PF 29-OCT-2001; 2 001WO-EP0124 96 . 
XX 

PR 30-OCT-2000; 2000US-0243745P . 
PR 26-DEC-2000; 2000US-02573 02P . 
PR 27-AUG-2001; 2001US-0314660P . 
XX 

PA (FARE ) 3AYER AG. 
XX 

PI Kossida S; 
XX 

DR WPI; 2002-490008/52. 
DR N-PSDB; ABK86166. 



XX 

PT New human inositol polyphosphate 5 -phosphate polypeptide, useful for 

PT treating chronic obstructive pulmonary disease, asthma, diabetes or 

PT cancer. 
XX 

PS Claim 25; Fig 12; 135pp; English. 
XX 

CC The invention relates to novel purified human inositol polyphosphate 5- 

CC phosphatase polypeptide (I) and the polynucleotide (II) encoding it. (I) 

CC and (II) are useful for screening for agents which decrease the activity 

CC of human inositol polyphosphate 5 -phosphatase polypeptide. (I) or (II) 

CC are also useful for screening for agents which modulate the activity of 

CC human inositol polyphosphate 5 -phosphatase . A reagent that modulates (I) 

CC or (II) is useful for treating a human inositol polyphosphate 5- 

CC phosphatase dysfunction related disease, such as chronic obstructive 

CC pulmonary disease, asthma, diabetes or cancer. The present sequence 

CC represents human inositol polyphosphate 5- phosphatase 

XX 

SQ Sequence 1006 AA; 

Query Match 43.0%; Score 868.5; DB 5; Length 1006; 

Best Local Similarity 47.8%; Pred. No. l.le-81; 

Matches 170; Conservative 54; Mismatches 12 7; Indels 5; Gaps 4; 

Qy 1 MDVLS PLS F I KVSHVRMQG I LLLVFAKYQHLP Y IQI LSTKSTPTGLFGYWGNKGGVN I CL 60 

II I I = 1 = II Mllh:|hllll llh^ ! I III llllllllh: I 

Db 4 88 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 54 7 • 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDIIDLIIWFGDMNFR 12 0 

:h : \ \ \ \ \ h II -I II :| I I 1 I ft I = MlhlM 

Db 548 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSIjQQFQGPGAQGILDHDLVFWFGDLNFR 607 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLL'rtEFQEGRLLFPPTYKFDRNSNDY 180 

II : Mlh H = llllllhhl hh I I I I I I Ihlll :| I 

Db 608 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 667 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 ' 

III Ml I I I I II I I I I -I I II I II :: I III I hllllh 

Db 668 DTSAKKRKPAWTDRILWKVK- APGGGP - SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 725 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

I h Ihlll :| I = I I Mllllhll I I 

Db 72 6 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 785 

Qy 3 00 YVSYAWVGDSKVSCSDNLNQVTIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 3 55 

Ihl I I I II = = | hh hi- hh INI 

Db 786 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 839 



RESULT 14 
ABU54583 

ID ABU54583 standard; protein; 1006 AA. 
XX 

AC ABU54583; 
XX 

DT 03-JUN-2003 (first entry) 
XX 



DE Human NOVX polypeptide #42. 
XX 

KW Human; NOVX; metabolic disorder; cardiomyopathy; diabetes; ASD; 

KW hypertension; congenital heart defect; aortic stenosis; valve disease; 

KW atrial septal defect; atrioventricular canal defect; ductus arteriosus ; 

KW pulmonary stenosis; subaortic stenosis; ventricular septal defect; VSD; 

KW tuberous sclerosis; scleroderma; atherosclerosis; infectious disease; 

KW obesity; anorexia; neurodegenerative disorder; Alzheimer's disease; 

KW Parkinson's disease; immune disorder; haematopoietic disorder; 

KW haemophilia; hypercoagulat ion; Crohn's disease; cancer. 

XX 

OS Homo sapiens . 
XX 

PN WO200281498-A2 . 
XX 

PD 17-OCT-2002. 
XX 

PF 03-APR-2002; 2002WO-US010780 . 
XX 

PR 03-APR-2001; 2 001US- 028 1086P . 

PR 03-APR-2001; 2 00 1US - 02 8 113 6P . 

PR 05-APR-2001; 2 001US- 02 8 1863P . 

PR 05-APR-2001; 2 001US - 02 8 1906P . 

PR 06-APR-2001; 2 001US - 02 82 02 OP . 

PR 10-APR-2001; 2 001US- 02 82 93 OP . 

PR 10-APR-2001; 2 001US - 02 82 93 4P . 

PR 12-APR-2001; 2001US- 0283512P . 

PR 13-APR-2001; 2001US-0283710P . 

PR 17-APR-2001; 2001US- 0284234P . 

PR 19-APR-2001; 2 001US - 02 8532 5P . 

PR 20-APR-2001; 2 001U3 - 02 853 8 IP . 

PR 20-APR-2001; 2001US-0285609P . 

PR 23-APR-2001; 2 001US- 02 8574 8P . 

PR 23-APR-2001; 2001US-0285890P . 

PR 24-APR-2001; 2001US- 0286068P . 

PR 25-APR-2001; 2001U3-0286292P . 

PR 27-APR-2001; 2 001US- 02 872 13P . 

PR 02-MAY-2001; 2001US-0288257P . 

PR 29-MAY-2001; 2 001US- 02 94 164P . 

PR 30-MAY-2001; 2 00 1US - 02 944 84P . 

PR 18-JUN-2001; 2 001US- 02 98 952P . 

PR 19-JUN-2001; 2001US-0299237P . 

PR 19-JUN-2001; 2001US-02 99276P . 

PR 12-SEP-2001; 2 00 1US - 03 18750P . 

PR 25-SEP-2001; 2 0 0 1US - 0324 800P . 

PR 25-SEP-2001; 2 00 1US - 0324 802P . 

PR 27--SEP-2001; 2 00 1US - 032 5684P . 

PR 17-OCT-2001; 2 00 1US - 03 3 0 14 3P . 

Pli 14-NOV-2001; 2 00 1US - 0332 13 IP . 

PR 14-NOV-2001; 2 00 1US - 033 2 24 OP . 

PR 14-NOV-2001; 2 00 1US - 0332 77 9P . 

PR 21-NOV-2001; 2 00 1US - 03 32 115P . 

PR 04-DEC-2001; 2 001US - 03 3 762 IP . 

PR 03-JAN-2002; 2 002US - 034 5783P . 

PR 16-JAN-2002; 2002US- 0350251P . 

PR 02-APR-2002; 2 002US - 001142 70 . 
XX 



PA (CURA-) CURAGEN CORP. 
XX 

PI Guo X, Kekuda R, Miller CE, Malyankar UM, Spytek KA; 

PI Patturajan M, Liu X, Gusev VY, Li L, Vernet CAM, Zerhusen BD; 

PI Gorman L, Shenoy SG, Pena CEA, Smithson G, Burgess CE, Gerlach V; 

PI Padigaru M, Shimkets RA, Gangolli EA, Taupier RJ, Casman SJ, Ji W; 

PI Anderson DW, Leite MW, Rastelli L, Edinger SR, Stone DJ; 

PI Macdougall JR, Rothenberg ME, Mazur A, Millet I, Peyman JA; 

PI Ellerman K; 

XX 

DR WPI; 2003-046858/04. 

DR N-PSDB; ABX72211. 
XX 

PT New isolated NOVX polypeptide useful for treating atherosclerosis, 

PT metabolic disorders, diabetes, obesity, infectious disease, anorexia, 

PT neurodegenerative disorders, Alzheimer's disease and cancer. 
XX 

PS Claim 1; Page 179-180; 666pp; English. 
XX 

CC The invention relates to human polypeptides, termed NOVX, and the 

CC polynucleotides encoding them. The polypeptides and polynucleotides are 

CC useful for diagnosing disease, and screening for potential therapeutic 

CC agents. The sequences are useful for treating metabolic disorders, 

CC cardiomyopathy, diabetes, hypertension, congenital heart defects, aortic 

CC stenosis, atrial septal defect (ASD) , atrioventricular canal defect, 

CC ductus arteriosus, pulmonary stenosis, subaortic stenosis, ventricular 

CC septal defect (VSD) , valve diseases, tuberous sclerosis, scleroderma, 

CC atherosclerosis, obesity, infectious disease, anorexia, neurodegenerative 

CC disorders, Alzheimer's disease, Parkinson's disease, immune disorders, 

CC haematopoietic disorders, haemophilia, hypercoagulation, Crohn's disease 

CC and cancer. Sequences ABU54 542 -ABU54 647 represent human NOVX polypeptides 

CC of the invention 

XX 

SQ Sequence 1006 AA; 

Query Match 43.0%; Score 868.5; DB 6; Length 1006; 

Best Local Similarity 47.8%; Pred. No. l.le-81; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; Gaps 4; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I :h II lllll-lhill III::: I I III IIIIMII|:: I 

Db 438 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 547 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

:|: : MINI I II -I II I :| lllllh IMMM 

Db 548 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 607 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

|| : || (: :| : 1111111 = ^1 I I = I = II II I I I I = I I I = I I 

Db 60S IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 667 

Qv 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III II Mill I I II I -I I II i II :: I III I :|IMI|: 

Db 668 DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 725 

Qy 24 0 GTFDLELKPLVSAPLIVTjMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

I I : Ih I II H I : I I MINIMI I I 



Db 72 6 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 785 



Qy 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

Ihl I I I II = = | = 1 = 1 I I I = = = :|h MM 

Db 786 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 839 

RESULT 15 
AAU98902 

ID AAU98902 standard; protein; 751 AA. 
XX 

AC AAU98902; 
XX 

DT 22-AUG-2002 (first entry) 
XX 

DE Human inositol polyphosphate 5-phosohatase #1. 
XX 

KW Human; inositol polyphosphate 5-phosphatase; asthma; diabetes; 

KW chronic obstructive pulmonary disease; cancer; enzyme. 

XX 

OS Homo sapiens . 
XX 

PN WO200236755-A2 . 
XX 

PD 10-MAY-2002 . 
XX 

PF 29-OCT-2001; 2001WO-EP0124 96 . 
XX 

PR 30-OCT-2C00; 2 000US - 024374 5P . 

PR 26-DEC-2000; 2 0 00US - 02 573 02P . 

PR 27-AUG-2001; 2 001US - 03 14660P . 

XX 

PA (FARB ) BAYER AG. 
XX 

PI Kossida S; 
XX 

DR WPI; 2002-490008/52. 

DR N-PSDB; ABK86158. 
XX 

PT New human inositol polyphosphate 5 -phosphate polypeptide, useful for 

PT treating chronic obstructive pulmonary disease, asthma, diabetes or 

PT cancer. 
XX 

PS Claim 25; Fig 2; 135pp; English. 
XX 

CC The invention relates to novel purified human inositol polyphosphate 5- 

CC phosphatase polypeptide (I) and the polynucleotide (II) encoding it. (I) 

CC and (II) are useful for screening for agents which decrease the activity 

CC of human inositol polyphosphate 5-phosphatase polypeptide. (I) or (II) 

CC are also useful for screening for agents which modulate the activity of 

CC human inositol polyphosphate 5-phosphatase. A reagent that modulates (I) 

CC or (II) is useful for treating a human inositol polyphosphate 5- 

CC phosphatase dysfunction related disease, such as chronic obstructive 

CC pulmonary disease, asthma, diabetes or cancer. The present sequence 

CC represents a human inositol polyphosphate 5- phosphatase 

XX 

SQ Sequence 751 AA; 



Query Match 41.2%; Score 831.5; DB 5; Length 751; 

Best Local Similarity 42.9%; Pred. No. 5.8e-78; 

Matches 169; Conservative 56; Mismatches 126; Indels 43; Gaps 6; 

Qy 1 MDVLSPLSFIKVSH VRMQG I LLLVF A 26 

II I I :h hi I I I I I = = 1 I = I I 

Db 195 MDALGPFNFVLVTHPSPPGQPETLLNSWLQLYPGSLWGPLGLCGWVSSVRMQGV1LLLFA 254 

Qy 27 KYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLE 86 

II Mh:: I I III MINIMI- I :h : MUM h II 
Db 2 55 KYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRLAAFGHMLCFLNCHLPAHMDKAEQRKD 314 

Qy 87 HFDRILEMQNCEGRDIPNILDHD LI IWFGDMNFRIEDFGLHFVRES I KNRCYGGLW 142 

:| I I I I lllh M 1111 = 11111 : MM I : II 

Db 315 NFQTILSLQQFQGPGAQGILDHEYGLGLVFWFGDLNFRIESYDLHFVKFAIDSDQLHQLW 374 

Qy 14 3 EKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQ 2 02 

MIIMMI MM MM I I MMM M MM 1 1 1 1 1 1 1 1 MM- I 

Db 375 EKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKYDTSAKKRKPAWTDRILWKVK-A 433 

Oy 203 PCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPED 261 

I M = I II I Ml I MMMM I M lh i I 

Db 434 PGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVAAQFLLQFAFRDDMPLVRLEVAD 492 

Qy 2 62 LWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVY 321 

I Ml : I I II II II hi I I 111 = 1 I I III 

Db 4 93 SWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKDYVAYVWAKHEDV- -DGNTYQVT 550 • 

Qy 322 IDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

-I MM Ml- : = ||: |||| 

Db 551 FSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 5 84 



Search completed: February 10, 2005, 17:49:57 
Job time : 17 0 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 

OM protein - protein search, using sw model 

Run on: February 10, 2005, 17:50:07 ; Search time 43 Seconds 

(without alignments) 
645.801 Million cell updates/sec 

Title : US -09-892 -2 87-1 

Perfect score: 2020 

Sequence: 1 MDVLS PL S F I KVSHVRMQGI FQIPPGSLREDPLGEAQFQI 3 72 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 

Searched: 513545 seqs, 74649064 residues 
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Minimum DB seq length: 0 
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Post-processing: Minimum Match 0% 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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5, 


Appli 


24 


287 


14 


2 


1149 


3 


us- 


09- 


418 


-540-5 


Sequence 


5, 


Appli 


25 


2 87 


14 


2 


1149 


4 


us- 


09- 


969 


-528-5 


Sequence 


5, 


Appli 


26 


284 


14 


1 


1254 


4 


us- 


09- 


949 


-016-7777 


Sequence 


7777, Ap 


27 


2C4 


14 


1 


1258 


4 


us- 


09- 


922 


-543-1 


Sequence 


1, 


Appli 


28 


263 


13 


0 


654 


3 


us- 


08- 


560 


-005-10 


Sequence 


10, 


Appl 


29 


263 


13 


0 


654 


3 


us- 


09- 


418 


-540-10 


Sequence 


10, 


Appl 


30 


263 


13 


0 


654 


4 


us- 


09- 


969 


-528-10 


Sequence 


10, 


Appl 


31 


252 


12 


5 


968 


1 


us- 


08- 


434 


-730-14 


Sequence 


14, 


Appl 


32 


251 


12 


4 


1185 


3 


us- 


08- 


664 


-962B-2 


Sequence 


2, 


Appli 



33 


251 


12 


.4 


1185 


3 


US- 


09- 


311 


-743-2 


Sequence 


2, Appli 


34 


250 


12 


4 


976 


3 


us- 


08- 


560 


-005-2 


Sequence 


2, Appli 


35 


250 


12 


.4 


976 


3 


us- 


09- 


195 


-868-14 


Sequence 


14, Appl 


36 


250 


12 


.4 


976 


3 


us- 


09- 


418 


-540-2 


Sequence 


2, Appli 


37 


250 


12 


.4 


976 


4 


us- 


09- 


969 


-528-2 


Sequence 


2, Appli 


38 


250 


12 


.4 


1187 


3 


us- 


08- 


664 


-962B-8 


Sequence 


8, Appli 


39 


250 


12 


.4 


1187 


3 


us- 


09- 


311 


-743-8 


Sequence 


8, Appli 


40 


250 


12 


.4 


1189 


3 


us- 


09- 


195 


-868-15 


Sequence 


15, Appl 


41 


250 


12 


.4 


1229 


3 


us- 


09- 


195 


-868-28 


Sequence 


28, Appl 


42 


220.5 


10 


. 9 


398 


3 


us- 


08- 


560 


-005-3 


Sequence 


3, Appli 


43 


220.5 


10 


.9 


398 


3 


us- 


09- 


418 


-540-3 


Sequence 


3, Appli 


44 


220.5 


10 


.9 


398 


4 


us- 


09- 


969 


-528-3 


Sequence 


3, Appli 


45 


180 


8 


.9 


230 


4 


us- 


09- 


270 


-767-44142 


Sequence 


44142, A 



ALIGNMENTS 



RESULT 1 
US-08-834-681-1 

Sequence 1, Application US/08884681 
Patent No. 5955338 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Hillman, Jennifer L. 
Lai, Preeti 
Corley, Neil C. 
Shah, Purvi 



TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYLINOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/884 , 681 

FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 3 6,74 9 

REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO : 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 372 amino acids 



TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRSTNOT03 
CLONE: 638789 
US-08-884-681-1 



Query Match 100.0%; Score 2020; DB 2; Length 372; 

Best Local Similarity 100.0%; Pred. No. 1.3e-208; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



Qy 

Db 



1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 

■ I II I I ! I I I i I I I I II I I I I I III III I I I ill I II I I I I I I I III II I I I I II I I I I I 

1 MDVLS PLS F IKVSHVRMOG ILLLVFAKYQHLPY IQI LSTKSTPTGLFGYWGNKGGVNI CL 



60 



60 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

M I I I I I II I I M II I i I II II II I I II II II II I I II II II I I II II I II I I II I II I 

61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 12 0 



121 



241 



180 



IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 

I II M M II I I M I II II I I I I I I Mi II II I I Ml I Ml I I I II I III II I II I I I I 
121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 



131 



240 



DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 

■M 1 1 1 III II M I IM Ml II MMI II M 1 1 i II I MM 1 1 II I Ml II II I M M 

131 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 



300 



TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFP3SPWDWIGLYKVGLRDVNDY 

i 1 1 1 1 1 1 1 1 1 Ml I IN 1 1 1 M I MM II II I 1 1 II II M I IMI I II I II 1 1 1 1 1 1 1 

241 TFDLELKPLVSAPLIVLMPEDLWTVENDMm'SYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 



Qy 

Db 

Qy 

Db 



301 VS YAWVGDSKVSCSDNLNQVYIDI SNI PTTEDEFLLCYYSNSLRS WGI SRPFQI PPGSL 360 

M M 1 1 M 1 1 1 1 M M 1 1 II M I! M II II 1 1 1 II 1 1 1 II 1 1 1 II M II 1 1 II 1 1 1 I II 

301 VSYAWVGDSKVSCSDNLNQVY ID I SNI PTTEDEFLLCYYSNSLRSWGI SRPFQI PPGSL 3 60 



361 



372 



REDPLGEAQPQI 

I I 1 I I I I I I I I I 
3 61 REDPLGEAQPQI 3 72 



RESULT 2 
US-09-258-643-1 

Sequence 1, Application US/09258643 
Patent- No. 6277373 
GENERAL INFORMATION: 

APPLICANT: Hillman, Jennifer L. 
APPLICANT: Lai, Preeti 
APPLICANT: Corley, Neil C. 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYL INOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 



COUNTRY : USA 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/258 , 643 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/884,681 
FILING DATE: 
ATTORNEY /AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE /DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX : 415-845-4166 
INFORMATION FOR SEQ ID NO : 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH:. 372 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
IMMEDIATE SOURCE : 

LIBRARY: BRSTNOT03 
CLONE : 638789 
'US-09-253-643-1 

Query Match 100.0%; Score 2020; DB 3; Length 372; 

Best Local Similarity 100.0%; Pred. No. 1.3e-208; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

1 1 M I III M I II I II 1 1 Ml 1 1 MM M 1 1 1 II M I II I II II M I II II II 1 1 1 M 

Db 1 MDVLS PLS F I KVSHVRMQGI LLLVFAKYQHLP YI QI LSTKSTPTGLFGYWGNKGGVNI CL 60 

Qv 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

II I II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II III Mill II III Ml 1 1 1 I 1 1 1 M III 1 1 1 1 1 II I 

Db 61 KL YG Y YVS I INCHL PPH I SNN YQRLEHFDR I LEMQNCEGRD I PN I LDHDL 1 1 WFGDMNFR 120 

Qy 121 IEDFGLHFVRES I KNRCYGGLWEKDQLS IAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 e 1 1 1 1 1 1 1 1 1 r i i 1 1 1 1 1 1 1 1 1 1 1 1 1 f i 

Db 121 IEDFGLHFVRES I KNRCYGGLWEKDQLS IAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

M M I II I I M 1 1 1 1 1 M 1 1 1 M III MM I Ml 1 1 1 1 1 1 M II I II MM I 1 1 1 I 1 1 

Db 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

Qv 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

MIMIMI MMMMMMMMIMM MIMMIM MMIIIMI llllll 

Db 241 TFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWDW IGL YKVGLRDVND Y 300 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

lllllllll IMIIIIIMIIIIMIIIIIIIIIIMIIIIIIMIIIIMIMIIIIII 



Db 301 VSYAWVGDSKVSCSDNLNQWIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 360 



Qy 361 REDPLGEAQPQ I 3 72 

Illlllllllll 
Db 361 REDPLGEAQPQ I 3 72 



RESULT 3 
US-08-884-681-3 

Sequence 3, Application US/08884681 
Patent No. 5955338 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Hillman, Jennifer L. 
Lai, Preeti 
Corley, Neil C. 
Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYL INOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo AltO 
STATE : CA 
COUNTRY : USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/ 884 , 68 1 
FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION : 
NAME:. Billings, Lucy J. 
REGISTRATION NUMBER: 3 6,74 9 
REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO : 3 : 
SEQUENCE CHARACTERISTICS: 
LENGTH: 329 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 1399101 
US-08-884-681-3 

Query Match 88.7%; Score 1791; DB 2; Length 329; 

Best Local Similarity 99.4%; Pred. No. 4.7e-184; 

Matches 326; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 4 5 GLFGYWGNKGGVNICLKLYGYYVS I INCHLPPHI SNNYQRLEHFDR ILEMQNCEGRDI PN 104 

II 1 1 1 1 1 II 1 1 1 1 ! 1 1 1 1 1 IMh II II 1 1 1 II I i II ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ' 1 1 

Db 2 GLFGYWGNKGGVNICLKLYGYYVS I INCHLPPHI SNNYQRLEHFDR I LEMQNCEGRD I PN 61 

Qy 105 ILDHDLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGR 164 

i I Ml II 1 1 1 1 1 1 1 1 1 1 II I Mi l : li ll I II 1 1 1 : 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 III l ■ 1 1 

Db 62 ILDHDLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGR 121 

Qy 165 LLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGY 224 

II 1 1 1 II I 1 1 II I II I II I III I i 1 1 1 1 I II 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M I 

Db 122 LLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTP I PPASHFSLSLRGY 181 

Qy 225 SSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPW 284 

II ■! II 1 1 I 1 1 Ih II 1 1 MM 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 ! M I 

Db 182 SSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPW 241 

Qy 2 85 DWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLR 344 

M M 1 1 1 1 M 1 1 1 1 1 II M I MM 1 1 1 1 1 1 II 1 1 M I II 1 1 M 1 1 I M Ml I M MM 

Db 242 DWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLR 301 

Qy 345 SWGISRPFQIPPGSLREDPLGEAQPQI 372 

MMI MMMMMMIMMMMI 

Db 3 02 SWGIRRPFQIPPGSLREDPLGEAQPQI 32 9 



RESULT 4 
S-09-258--643-3 
Sequence 3, Application US/09258643 
Patent No. 6277373 
GENERAL INFORMATION: 

APPLICANT : Hillman, Jennifer L . 
APPLICANT: Lai, Preeti 
APPLICANT: Corley, Neil C. 
APPLICANT : Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYL INOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals , Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/258 , 643 

FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/884 , 681 

FILING DATE: 
ATTORNEY/AGENT INFORMATION : 

NAME: Billings, Lucy J. 



REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO : 3 : 
SEQUENCE CHARACTERISTICS: 
LENGTH: 329 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 1399101 
US-09-258-643-3 

Query Match 88.7%; Score 1791; DB 3; Length 329; 

Best Local Similarity 99.4%; Pred. No. 4.7e-184; 

Matches 326; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 45 GLFGYWGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPN 104 

I II II II I M I III 1 1 1 II 1 1 II 1 1 i I M I II II M I M I M M II II I! 1 1 1 I M I M 

Db 2 GLFGYWGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPN 61 

Qy 103 ILDHDLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWSKDQLSIAKKHDPLLREFQEGR 164 • 

I! I M I ! I 1 1 MM 1 1 1 1 1 1 1 1 I! 1 1 1 1 1 ! 1 1 1 M Ml I 1 1 Mil ! 1 1 1 M I I M M ! 

Db 62 ILDHDLI IWFGDMNFRIEDFGLHFVRES IKNRCYGGLWEKDQLS IAKKHDPLLREFQEGR 121 

Qy 165 LLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGY 224 

IMIMIIMIIIMIIIIIIIIIIIIIIIMI II M II IMMIIMIIIMM 

Db 122 LLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGY 181 

Qy 225 SSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPW 284 

I M I M I M M II 1 1 1 1 11 M 1 1 1 M 1 1 ! 1 1 M M 1 1 1 M 1 1 1 1 II 1 1 1 1 1 

Db 182 SSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPW 241 

Qy 285 DWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLR 344 " 

1 1 1 II II I 1 1 1 1 1 1 II 1 1 1 II II I III I II Ml I M I 1 1 M I M MM 1 1 1 1 1 1 1 1 1 

Db 242 DWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLR 3 01 

Qy 345 SWGISRPFQIPPGSLREDPLGEAQPQI 372 

Mill II II I I II II I II 1 1 II 1 1 1 1 

Db 3 02 SWGIRRPFQIPPGSLREDPLGEAQPQI 32 9 



RESULT 5 
US-08-884-681-4 

Sequence 4, Application US/08884681 
Patent No. 5955338 
GENERAL INFORMATION: 

APPLICANT : Hillman, Jennifer L . 
APPLICANT: Lai, Preeti 
APPLICANT: Corley, Neil C. 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYL INOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 



ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/884 , 681 
FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-03 34 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO : 4 : 
SEQUENCE CHARACTERISTICS: 
LENGTH: 942 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE : 
LIBRARY: GenBank 
CLCNj;: 1019103 
US-08-884-681 -4 

Query Match 21.4%; Score 432.5; DB 2; Length 942; 

Best Local Similarity 37.0%; Pred. No. 4.6e-37; 

Match-as 90; Conservative 45; Mismatches 87; Indels 21; Gaps- 4; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 ■ 

: II =h || = lh= I =1 II : =: Ih I llllll I : = = 
Db 365 YAKVKL I RL VG I MLLL YVKQEHAAY I S E VEAETVGTG I MGRMGNKGGVAI RFQFHNTS I C 424 

Qy 69 IINCHLPPHISNNYQRLEHF-DRILEMQNCE-GRDIP- -NILDHDLIIWFGDMNFRIEDF 124 

-I II II =1 : = I II h =1 I = i I = I = 1 Ihhilh 

Db 425 WNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISIvTHDVILWLGDLNYRIEEL 484 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 

: h = h : = h III I : I II I I lllhl hhllll 

Db 485 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 544 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 • 

I I III 11111= I I Ml lllllll Ih 

Db 545 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 587 



Qy 



245 ELK 247 



Db 



588 GVR 590 



RESULT 6 
US-08-560-005-6 

Sequence 6, Application US/08560005 
Patent No. 6001354 
GENERAL INFORMATION: 

APPLICANT: Pot, David A. 
APPLICANT: Williams, Lewis T. 
APPLICANT: Jefferson, Anne Bennett 
APPLICANT: Majerus, Philip W. 

TITLE OF INVENTION: No. 6001354el Grb2 Associating Protein and Nucleic 
TITLE OF INVENTION: Acids Encoding Therefor 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Townsend and Tovmsend and Crew 
STREET: One Market Plaza, Steuart Tower, Suite 2000 
CITY: San Francisco 
STATE: California 
COUNTRY : USA 
ZIP : 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/560,005 
FILING DATE: 
CLASSIFICATION: 43 5 
ATTORNEY/AGENT INFORMATION: 
NAME: Dow, Karen B. 
REGISTRATION NUMBER: 29,684 
REFERENCE / DOCKET NUMBER: 23 07K- 0624 000 
TELECOMMUNICATION INFORMATION : 
TELEPHONE: 415-326-2400 
TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO : 6: 

SEQUENCE CHARACTERISTICS: • ' " 

LENGTH: 942 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
FEATURE : 

NAME / ICE Y : Region 
LOCATION: 1..942 

OTHER INFORMATION: /note= "majptase" 
US-03-560-005-6 

Query Match 21.4%; Score 432.5; DB 3; Length 942; 

Best Local Similarity 37.0%; Pred. No. 4.6e-37; 

Matches 90; Conservative 45; Mismatches 37; Indels 21; Gaps 4; 

Qy 9 F I KVSHVRMQG I LLLVFAKYQHLPY IQI LSTKSTPTGLFG YWGNKGGVNICLKLYGYYVS 68 

: II =1= || = ||:= I =1 II : == Ih I MINI I * • = 
Db 365 Y AKVKL I RL VG I MLLL YVKQEHAA Y I S E VEAETVGTG IMGRMGNKGGVAI RFQFHNTS I C 424 



Qy 69 IINCHLPPHISNNYQRLEHF-DRILEMQNCE-GRDIP- -NILDHDLIIWFGDMNFRIEDF 124 

= :| II M :| : = I II 1= =1 I =11 = 1 = 1 Ihhllh 

Db 425 VVNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISNHDVILWMGDLNYRIEEL 4 84 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 

: l = = |= : = 1 = III I = I II I I Mlhl 1 = 1 = 1111 

Db 485 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 544 

Qy 185 KKRKPAWTDR I LWRLKRQPCAGPDTP I PPASHFSLSLRGYS SHMTYGI SDHKPVSGTFDL 244 

I I III 11111= I = = : I III MMMI 11 = 

Db 545 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 587 

Qy 245 ELK 247 

Db 588 GVR 590 



RESULT 7 
US-09-258-643-4 

Sequence 4, Application US/09258643 
Patent No. 6277373 
GENERAL INFORMATION: 

APPLICANT: Hillman, Jennifer L. 
APPLICANT: Lai, Preeti 
APPLICANT: Corley, Neil C. 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYLINOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/258,643 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/884 , 681 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION : 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO : 4 : 
SEQUENCE CHARACTERISTICS: 
LENGTH: 942 amino acids 



TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY : GenBank 
CLONE: 1019103 
US-09-258-643-4 

Query Match 21.4%; Score 432.5; DB 3; Length 942; 

Best Local Similarity 37.0%; Pred. No. 4.6e-37; 

Matches 90; Conservative 45; Mismatches 87; Indels 21; Gaps 4; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

: II =h || = || = = | :| || : =: ||: | Mill | : = = 
Db 365 YAKVKL I RL VG I MLLL YVKQEHAAY I S EVEAETVGTG I MGRMGNKGGVA I RFQFHNTS I C 424 

Qy 69 I INCHLPPHISNNYQRLEHF-DRILEMQNCE -GRDIP - -NILDHDLI IWFGDMNFRIEDF 124 

= = l II II H : : I II 1= =1 I =11 = 1 = 1 11 = 1 = 111 = 

Db 425 WNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISNHDVILWLGDLNYRIEEI. 484 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 

= h= I = = = I = I I I I = I II I I 1111 = 1 1 = 1 = 1111 

Db 4 85 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 54 4 

Qy 185 KKRKPAWTDR I LWRLKRQPCAGPDTP I PPASHFS LSLRG YS SHMTYG I SDHKPVSGTFDL 244 

I I III 11111= I : = = I III lllllll Ih 

Db 545 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPV3SVFDI 587 

Qy 245 ELK 247 

Db 583 GVR 590 



RESULT 8 
US-09-413-540-6 

Sequence 6, Application US/09418540 
Patent No. 6296848 
GENERAL INFORMATION: 

APPLICANT: Pot, David A. 
APPLICANT: Williams, Lewis T. 
APPLICANT: Jefferson, Anne Bennett 
APPLICANT: Majerus, Philip W. 

TITLE OF INVENTION: No. 6296848el Grb2 Associating Protein and Nucleic 
TITLE OF INVENTION: Acids Encoding Therefor 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew 
STREET: One Market Plaza, Steuart Tower, Suite 2000 
CITY: San Francisco 
STATC: California 
COUNTRY : USA 
ZIP: 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 



CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/4 18 , 54 0 

FILING DATE: 14-OCT-1999 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/560,005 

FILING DATE: 17 -NOV- 1995 
ATTORNEY/AGENT INFORMATION: 

NAME: Dow, Karen B. 

REGISTRATION NUMBER: 29,684 

REFERENCE/DOCKET NUMBER: 2307K- 0624000 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-32S-2400 

TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO : 6: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 942 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
FEATURE:' 

NAME /KEY : Region 

LOCATION: 1 . .942 

OTHER INFORMATION: /note= "majptase" 
US-09-413-540-6 

Query Match 21.4%; Score 4 32.5; DB 3; Length 942; 

Best Local Similarity 37.0%; Pred. No. 4.6e-37; 



Matches 90; Conservative 45; Mismatches 87; Indels 21; Gaps. 4 

Qv 9 F I KVSHVRMQGI LLLVFAKYQHLP YIQ I LST KSTPTGLFGYWGNKGGVNI CLKL YG YYVS 68 

: II =h ||:|| = = I M II = := Ih I Mill! I = = : 
Db 365 YAKVKL I RLVG I MLLL Y VKQEHAAY I S E VEAETVGTG I MGRMGNKGG VAI RFQFHNTS I C 424 

Qv 69 I INCHLPPHI3NNYQRLEHF -DRILEMQNCE - GRDI P - -NILDHDLI IWFGDMNFRIEDF 124 

-III II =| : : I Ml: =| I : I h h I I I : I : I I h 

Db 425 WNSHLAAHIEEYERRNQDYKDICSRMQFCQPUPSLPPLTISNHDVILWMGDLNYRIEEL 484 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQHGRLLFPPTYKFDRNSNDYDTSE 184 

= h: 1= = = 1= III I = I II I I lllhl hi HIM 

Db 485 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 544 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

I I Ml llllh I I III lllllll Ih 

Db 54 5 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 587 

Qy 245 ELK 247 

Db 588 GVR 590 



RESULT 9 
US-09-969-528-6 

; Sequence 6, Application US/09969528 
; Patent No. 6472197 

GENERAL INFORMATION: 

APPLICANT: Pot, David A. 



Williams, Lewis T. 
Jefferson, Anne Bennett 
Majerus, Philip W. 
TITLE OF INVENTION: No. 6472197el Grb2 Associating Protein and Nucleic 

Acids Encoding Therefor 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS : 
; ADDRESSEE: Townsend and Townsend and Crew 

STREET: One Market Plaza, Steuart Tower, Suite 2 000 
; CITY: San Francisco 

STATE: California 
COUNTRY: USA 
ZIP : 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC -DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/969 , 52 8 
FILING DATE: 01 -Oct -2 001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/560,005 
FILING DATE: <Unknown> 
ATTORNEY / AGENT INFORMATION: 
NAME: Dow, Karen B. 
; REGISTRATION NUMBER: 29,684 

REFERENCE/DOCKET NUMBER: 2307K- 0624000 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-326-2400 
TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO : 6:' 
SEQUENCE CHARACTERISTICS: 

LENGTH: 942 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
FEATURE : 

NAME/KEY: Region 
LOCATION: 1..942 

OTHER INFORMATION: /note= "majptase" 
SEQUENCE DESCRIPTION: SEQ ID NO : 6: 
US-09-969-528-6 

Query Match 21.4%; Score 4 32.5; DB 4; Length 942; 

Best Local Similarity 37.0%; Pred. No. 4.6e-37; 

Matches 90; Conservative 45; Mismatches 87; Indels 21; Gaps 4; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

: II :|= Ihll- | =| || : :: ||: I Mill I : = : 
Db 365 YAKVKL I RL VG I MLLL YVKQEHAAY I SE VE AETVGTG I MGRMGNKGG VAI RFQFHNTS I C 424 

Qy 69 IINCHLPPHISNNYQRLEHF-DRILEMQNCE-GRDIP- -NILDHDLIIWFGDMNFRIEDF 124 

-I II II =1 = = I II h =1 I :|hhl Ihhllh 

Db 425 WNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISNHDVILWMGDLNYRIEEL 484 



Qy 



125 



GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 



Db 



485 



DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 54 




QY 



185 



KKRKPAWTDR I LWRLKRQPCAG PDTPI PPASHFS LS LRG YS SHMTYG I SDHKPVSGTFDL 244 



Db 



545 



KCRAPAWCDR I LWKGK NI TQLS YQSHMALKTSDHKPVS S VFD 158 




Qy 



245 



ELK 247 



Db 



588 



GVR 590 



RESULT 10 
US-03-560-005-7 

; Sequence 7, Application US/08560005 

; Patent No. 6001354 

; GENERAL INFORMATION: 

APPLICANT: Pot, David A. 

APPLICANT: Williams, Lewis T. 
; APPLICANT: Jefferson, Anne Bennett 

APPLICANT: Majerus, Philip W. 

TITLE OF INVENTION: No. 6001354el Grb2 Associating Protein and Nucleic 
TITLE OF INVENTION: Acids Encoding Therefor 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Townsend and Tov/nsend and Crew 

STREET: One Market Plaza, Steuart Tower, Suite 2000 

CITY: San Francisco 

STATE: California 

COUNTRY : USA 

ZIP: 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/560 , 005 

FILING DATE: 

CLASSIFICATION: 435 
ATTORNEY /AGENT INFORMATION: 
; NAME : Dow , Karen B . 

REGISTRATION NUMBER: 2 9,684 

REFERENCE/DOCKET NUMBER: 23 07K- 0624 000 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 

TELEFAX: 415-326-2422 
; INFORMATION FOR SEQ ID NO: 7: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 968 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
FEATURE : 

NAME/KEY: Region 

LOCATION: 1 . . 968 



OTHER INFORMATION: /note= "ocrl" 
US-08-560-005-7 



Query Match 18.2%; Score 368.5; DB 3; Length 968; 

Best Local Similarity 29.1%; Pred. No. 3.8e-30; 

Matches 102; Conservative 55; Mismatches 121; Indels 73; Gaps 8; 

Qy 5 SPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I : II Ih |::||:|h Ih -I- !h I llllll = 

Db 382 S KAKYKKVQL VRLVGMMLL I F ARKDQCR Y I RD I ATETVGTG I MGKMGNKGG VAVRF VFHN 441 

Qy 65 YYVS I INCHLPPHI SNNYQRLEHFDRI LEMQNCEGRD I PNILDHD 109 

hi II h = II =1 = Hi Ih h 

Db 442 TTFC I VNSHLAAHVED LERRNQDYKDICARMSFWPNQTLPQLNIMKHE 490 

Qy 110 LIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPP 169 

::|| Ihhh = h I : I = Mhl : =1 M = I I 

Db 4 91 WIWMGDLNYRLCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIP 550 

Qy 170 TYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMT 229 

Mhl = | = | I I III llllll = = = I III 

Db 551 TYKYDSKTDRWDSSGKCRVPAWCDRILWR GTNVNQLNYR3HML 593 

Qy 23 0 YGISDHKPVSGTFDLELKPL VSAPLIVLMPEDLWTVENDMMVS YS - ST3DFPS 2 81 

lllllll I : :| = I : =1 : I I I = 1 I : I 

Db 594 LKTSDHKPVSALFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF- - 647 

Qy 2 82 S P WDW I GL YKVGLRD VND YVS YAWVGD S KVS C SDNLNQVY I D I SN I PTTED 332 

I : I : : I hi II I II I 

Db 64 8 VFENVKFRQLQKGKFQI SNN - GQVPCHFS F I PKLND 682 



RESULT II 
US-09-413-540-7 

Sequence 7, Application US/09418540 
Patent No. 6296848 
GENERAL INFORMATION : 

APPLICANT: Pot , David A. 
APPLICANT: Williams, Lewis T. 
APPLICANT: Jefferson, Anne Bennett 
APPLICANT: Majerus, Philip W. 

TITLE OF INVENTION: No. 6296848el Grb2 Associating Protein and Nucleic 
TITLE OF INVENTION: Acids Encoding Therefor 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew 
STREET: One Market Plaza, Steuart Tower, Suite 2000 
CITY: San Francisco 
STATE: California 
COUNTRY : USA 
21?: 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 



APPLICATION NUMBER: US/09/4 18 , 54 0 

FILING DATE: 14 -OCT- 1999 

CLAS S I F I CAT ION : 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/560,005 

FILING DATE: 17 -NOV- 1995 
ATTORNEY/ AGENT INFORMATION: 

NAME: Dow, Karen B. 

REGISTRATION NUMBER: 29,684 

REFERENCE/DOCKET NUMBER: 23 07K- 0624 000 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 

TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO : 7 : 
SEQUENCE CHARACTERISTICS: 

LENGTH: 968 amino acids 

TYPE: amine acid 

TOPOLOGY: linear 
MOLECULE TYPE: .protein 
FEATURE: 

NAME/KEY: Region 

LOCATION: 1. .968 

OTHER INFORMATION: /note= "ocrl" 
US-09-418-540-7 

Query Match 18.2%; Score 366.5; DB 3; Length 968; 

Best Local Similarity 29.1%; Pred. No. 3.8e-30, 

Matches 102; Conservative 55; Mismatches 121; Indels 73; Gaps ■ 8; • 

Qy 5 SPLSFIIO/SHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I : II lh |: = lhlh Ih =:|== Ih I MINI : 

Db 3 02 S KAKYKKVQLVRLVGMMLL I FARKDQCRY IRD I ATET VGTGIMGKMGNKGGVAVRFVFHN 441 

Qv 65 YYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDI PNILDHD 109 

hi II h : II =1 : :|| Ih h 

Db 442 TTFCIVNSHLAAHVED LERRNQD YKD I CARMSF WPNQTLPQLNIMKHE 490 

Qy 110 L 1 1 WFGDMNFR I EDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPP 169 

-II Ihhh : h I : ! : Mhl : • I II ' I I 

Db 491 WIWMGDLNYRLCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIP 550 

Qy 170 TYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMT 22 9 

Mhl : = MM I I III MIMI I III 

Db 551 TYKYDSKTDRWDSSGKCRVPAWCDRILWR GTNVNQLNYRSHML 593 

Qy 230 YGISDHKPVSGTFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS -STSDFPS 281 

Mill | : M : I : M MM : I I M 

Db 594 LKTSDHKPVSALFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF-- 647 . 

Qv 2 32 SPWDWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTED 332 

I : I = = I hi II III I 

Db 54C VFENVKFRQLQKGKFQI SNN - GQVPCHFS F I PKLND 682 



RESULT 12 
US-09-9S9-528 
; Sequence 7 , 



-7 

Application US/09969528 



; Patent No. 6472197 

GENERAL INFORMATION: 

APPLICANT: Pot, David A. 

Williams, Lewis T. 
; Jefferson, Anne Bennett 

Majerus, Philip W. 
TITLE OF INVENTION: No. 6472197el Grb2 Associating Protein and Nucleic 
; Acids Encoding Therefor 

NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Townsend and Townsend and Crew 

STREET: One Market Plaza, Steuart Tower, Suite 2000 
CITY: San Francisco 
STATE : California 
COUNTRY: USA 
ZIP : 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/969 , 528 
FILING DATE: Ol-Oct-2001 
CL AS S I F I CAT I ON : < Unknown > 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 03/560,005 
FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: Dow, Karen B . 
REGISTRATION NUMBER: 29,684 
REFERENCE/DOCKET NUMBER: 2307K- 0624 000 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-326-2400 
TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO : 7 : 
SEQUENCE CHARACTERISTICS: 

LENGTH: 968 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
FEATURE : 

NAME / KEY : Region 
LOCATION: 1 . . 968 

OTHER INFORMATION: /note= "ocrl" 
SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
US-09-969-528-7 

Query Match 18.2%; Score 368.5; DB 4; Length 968; 

Best Local Similarity 29.1%; Pred. No. 3.8e-30; 

Matches 102; Conservative 55; Mismatches 121; Indels 73; Gaps 8; 

Qy 5 SPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I : II lh h:||:|h Ih = = |=: Ih I MINI = 

Db 3 82 S KAKYKKVQLVRLVGMMLL I FARKDQCRY IRD I ATETVGTG IMGKMGNKGGVAVRFVFHN 441 



65 YYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDI 



PNILDHD 109 



Db 



44 2 TTFCIVNSHLAAHVED 



LERRNQDYKDICARM9FWPNQTLPQLNIMKHE 4 90 



Db 



Qy 



110 LIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPP 169 

::|| Ihhh : h I = I = Mhl = = 111 = 11 

4 91 WI WMGDLNYRLCMPDANEVKSL INKKDLQRLLKFDQLNI QRTQKKAFVDFNEGE I KF I P 550 



Qy 



17 0 TYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMT 22 9 



Db 



551 TYKYDSKTDRWDSSGKCRVPAWCDRILWR GTNVNQLNYRSHML 593 




Db 



Qy 



230 YGISDHKPVSGTFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS -STSDFPS 281 

Illllll I : =1 = I = =1 'III = 1 I : I 

594 LKTSDHKPVSALFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF- - 647 



Db 



Qy 



282 S P WD W I GL YKVGLRD VND YVS YAWVGDS KVS C SDNLNQV Y ID I SN I PTTED 332 

I = I : : I hi II I II I 

648 VFENVKFRQLQKGKFQISNN-GQVPCHFSFIPKLND 682 



RESULT 13 
US-08-884-681-5 

; Sequence 5, Application US/08884681 

; Patent No. 5955338 

; GENERAL INFORMATION: 

APPLICANT: Hillman, Jennifer L. 

APPLICANT: Lai, Preeti 

APPLICANT: Corley, Neil C. 

APPLICANT: Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYLINOSITOL 
NUMBER OF SEQUENCES : 5 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/884 , 681 

FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 3 6,74 9 

REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-355-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO : 5: 



SEQUENCE CHARACTERISTICS: 
; LENGTH: 901 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 142 0920 
US-08-884-681-5 



Query Match 18.2%; Score 367.5; DB 2; Length 901; 

Best Local Similarity 29.4%; Pred. No. 4.3e-30,\ 

Matches 100; Conservative 58; Mismatches 131; Indels 51; Gaps 8; 

Qy 5 SPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I : II lh l=:|| = ll= lh : = l = = lh I MINI = 

Db 3 07 S KAKYKKVQL VRLVGMMLL I FARKDQCR Y I RD I ATETVGTG I MGKMGNKGGVAVRF VFHN 366 

Qy 65 Y YVS I I NCHL PPH I SNNYQRLEHFDR I LEMQN - - CEGRD I P - - N I LDHDL I I WFGDMNFR 120 



Db 3 67 TTFCIVNSHLAAHVEDFERRNQDYKDICARMSFWPNQTLPQLNIMKHEWIWLGDLNYR 42 6 



Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

: |= | = | : |||:| : :| || = | MM =: : 

Db 427 LCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIPTYKYDSKTDRW 486 

Qy 181 DTSSKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

hi I I III MINI I III MINN 

Db 407 DSSGKCRVPAWCDRILWR GTNVNQLNYRSHMELKTSDHKPVSA 52 9 



Qy 1>41 TFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS -STSDFPSSPWDWIGLYKV 292 

I : :| = ! : =| -III : I I = I 
Db 53 0 LFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF 572 



Qy 293 GLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTED 332 

h h : I hi II I II I 

Db 573 VFENVKFRQLQKGKFQISNN-GQVPCHFSFIPKLND 607 



RESULT 14 
US-09-258-643-5 

; Sequence 5, Application US/09258643 

; Patent No. 6277373 

; GENERAL INFORMATION: 

APPLICANT: *' Hillman, Jennifer L . 

APPLICANT: Lai, Preeti 

APPLICANT: / Corley, Neil C. 

APPLICANT; • Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYLINOSITOL 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
; STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/258 , 643 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 8 84 , 681 
FILING DATE: 
ATTORNEY/ AGENT INFORMATION : 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 
TELE PHONE : 415-855-0555 
TELEFAX: 415-845-4166 
. INFORMATION FOR SEQ ID NO : 5 : 
SEQUENCE CHARACTERISTICS: 
LENGTH: 901 amino acids 
TYPE: amino acid 
STRAJSIDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY : GenBank 
CLONE: 142 0 92 0 
US-09-258-643-5 

Query Match 18.2%; Score 367.5; DB 3; Length 901; 

Best Local Similarity 29.4%; Pred. No. 4.3e-30; 

Matches 100; Conservative 58; Mismatches 131; Indels 51; Gaps 8; 

Qy 5 S PLS F I KVSHVRMQG ILLLVFAKYQHLP YIQ ILSTKSTPTGLFG YWGNKGGVNICLKLYG 64 

I = II Ih l= = ll = ll= Ih =:h= Ih I MINI = 

Db 307 S KAKYKKVQL VRLVGMMLL I FARKDQCRY I RD I ATETVGTG I MGKMGN KGGVAVRF VFHN 366 

Qy 65 Y YVS I INCHLPPHI SNNYQRLEHFDRI LEMQN - - CEGRD IP- - NI LDHDL I I WFGDMNFR 120 

|:| || |: : :| : : | : : =| ||: |:::|| ||:|:| 

Db 367 TTFCIVNSHLAAHVEDFERRNQDYKDICARMSFWPNQTLPQLNIMKHEWIWLGDLNYR 426 

QV 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

: |: | : | : |||:| : =| || = | ||||:| =: : 

Db 42 7 LCMPD ANE VKS L INKKDLQRLLKFDQLN I QRTQKKAF VD FNEGE I KF I PT YKYDS KTDRW 4 86 

Qy 131 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

hi I I III MM ::: I III IIIIIM 

Db 487 DSSGKCRVPAWCDRILWR GTNVNQLNYRSHMELKTSDHKPVSA 529 

Qy 241 TFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS-STSDFPSSPWDWIGLYKV 292 

I : =| = I = =| :||| = I I :| 
Db 530 LFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF 572 

Qy 2 93 GLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTED 332 

I : I : : I hi II I II I 

Db 573 VFENVKFRQLQKGKFQI SNN - GQVPCHFS FI PKLND 607 



RESULT 15 

US-09-949-016-7283 

Sequence 7283, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION: 
APPLICANT: VENTER, J. Craig et al . 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 
TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 
THEREOF 

FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER : US/09/949,016 
CURRENT FILING DATE: 2 000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,760 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 7283 
LENGTH: 952 
TYPE : PRT 
ORGANISM: Human 
US-09-949-016-7283 

Query Match 18.2%; Score 367.5; DB 4; Length 952; 

Best Local Similarity 29.4%; Pred. No. 4.7e-30; 

Matches 100; Conservative 58; Mismatches 131; Indels 51; Gaps 8; 

Qy ■ 5 SPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I = II Ih l=:|| = ll= Ih : = ! = = Ih I llllll = 

Db 3 66 S KAKYKKVQL VRL VGMMLL I F ARKDQCR Y I RD I ATET VGTG I MGKMGNKGGVAVRF VFHN 42 5 

Qy 65 YYVSIINCHLPPHISNNYQRLEHFDRILEMQN- -CEGRDIP- -NILDHDLIIWFGDMNFR 120 

hi II h : =1 : : I = = =1 Ih | = = :|| Ihhl 

Db 42 6 TTFCIVNSHLAAHVEDFERRNQDYKDICARMSFWPNQTLPQLNIMKHEWIWLGDLNYR 4 85 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

= 1=1= I = 111 = 1 = =111 = 1 lllhl == = 

Db 436 LCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIPTYKYDSKTDRW 545 

Qv 131 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

hi I I III llllll I III I I I I I 

Db 546 DSSGKCRVPAWCDRIIiWR GTNVNQLNYRSHMELKTSDHKPVSA 588 

Qy 241 TFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS -STSDFPSSPWDWIGLYKV 292 

I = =1 = I = =1 =111 = 1 I =1 
Db 5G9 LFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF 631 

Cy 2 S3 GLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTED 332 

I = I = = I hi II I II I 

Db 632 VFENVKFRQLQKEKFQI SNN - GQVPCHFSF I PKLND 666 



Search completed: February 10, 2005, 18:03:02 



Job time : 45 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



February 10, 2005, 17:50:47 ; Search time 133 Seconds 

(without alignments) 
913.915 Million cell updates/sec 



Title : 

Perfect score : 
Sequence : 



US-09-892-287-1 
2020 

1 MD VL S PL S F I KVS HVRMQG I FQIPPGSLREDPLGEAQPQI 372 



Scoring table : 



BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 



Searched : 



1376875 seqs, 326749119 residues 



Total number of hits satisfying chosen parameters: 



1376875 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-prr-cessing : Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database : Published__Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep:* 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB .pep: * 

7: /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB .pep : * 
10: / cgn2_6 /p t oda t a/ 2 /pubpaa/US 0 9B_PUBC0MB . pep : * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C__PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 
15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 
16: /cgn2_6/ptodata/2/pubpaa/US10D_PUBCOMB.pep: * 
17 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB .pep : * 
18: / cgn2_6 /pt oda t a/ 2 /pubpaa/US 1 1_NEW_PUB . pep : * 
19 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB .pep : * 
20 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-S92-287-1 

; Sequence 1, Application US/09892287 



; Patent No. US20020127217A1 
GENERAL INFORMATION: 

APPLICANT: Hillman, Jennifer L. 
Lai, Preeti 
; Corley, Neil C. 

Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYL INOSITOL 

(4,5) BISPHOSPHATE 5 -PHOSPHATASE 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
; ZIP : 94304 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; • COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/892 , 287 

FILING DATE: 26-Jun-2001 
■ PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/258,643 

FILING DATE: <Unknown> 
ATTORNEY /AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; I N FORMAT I ON FOR SEQ ID NO : 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 372 amino acids 
; TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRSTNOT03 

CLONE: 638789 
SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
US-09-392-287-1 

Query Match 100.0%; Score 2020; DB 9; Length 372; 

Best Local Similarity 100.0%; Pred. No. 2.2e-188; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

Mil Miilll llllllllllll III IIMMMMIIIIIIIMIIIII III I 

Db 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

Qy 61 KLYGYYVS I INCHLPPHI SNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

. I I I ! I I I I i I I I I I t I I I I I I I i I I I I I I I I I I i I i I I I I I I I I I I I I i I I I I I I I I I 
Db 61 KLYGYYVS I INCHLPPHI SNNYQRLEHFDRILEMQNCEGRD I PNI LDHDL I I WFGDMNFR 120 



Qy 121 IEDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

I M I II M II 1 1 M I II II I i 1 1 III II III I II I I i i I Mi II I ill I i M II Mi I II 

Db 121 I EDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

Qy 181 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTP I PPASHFSLSLRGYS SHMTYG I SDHKPVSG 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 

Db 181 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTPI PPASHFSLSLRGYS SHMTYGI SDHKPVSG 240 

Qy 241 TFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWDWIGLYKVGLRDVND Y 3 00 

I I I II 1 1 II II 1 1 1 II II 1 1 1 1 1 1 1 II M 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II I II I II 

Db 241 TFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWDWIGLYKVGLRDVND Y 3 00 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

I I I I I I I I I II I I- I I I I I I I! I I I II hi I I II 1 II I I I M I I I I I II I II I I I : I I 

Db 3 01 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 



Qy 3 61 REDPLGEAQPQI 3 72 

IIMIIMIMI 
Db 3 61 REDPLGEAQPQI 3 72 



RESULT 2 

US-10-161-510-14 

; Sequence 14, Application US/10161510 

; Publication No. US2 0020192695A1 

; GENERAL INFORMATION: 

; APPLICANT: EXELIXIS, INC. 

; TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
; .FILE REFERENCE: EX02-074C 

; CURRENT APPLICATION NUMBER: US/10/161,510 

CURRENT FILING DATE: 2002-06-03 

PRIOR APPLICATION NUMBER: US 60/296,076 
; PRIOR FILING DATE: 2001-06-05 
; PRIOR APPLICATION NUMBER: US GO/ 3. 28, 605 

PRIOR FILING DATE: 2001-10-10 
; PRIOR APPLICATION NUMBER: US 60/357,253 
; PRIOR FILING DATE: 2002-02-15 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 14 
LENGTH : 44 8 
T^PE: PRT 

ORGANISM: Homo sapiens 
US-10-161-510-14 

Query Match 100.0%; Score 2020; DB 13; Length 448; 

Best Local Similarity 100.0%; Pred. No. 2.8e-188; 

Matches 372; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

III llllllllll IIIIIIIIIMMIIIIIIIIMIIIIIMIIIIIMIIIIII I 

Db 77 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 136 



Qy 61 KLYGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 12 0 

I I I I I III I II II I III II I I I II i II II I I III I I I I II I I I Ml i I III I I III I M 
Db 137 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 196 



Qy 121 I EDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

IMIIIIIIIMII I IMIIIIMIIIIIMIIMMIIIIIIIIIIMIIIIllllllj 

Db 197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

II Ml III III II ! I II lllllllllill IIIIIIIIIIIIIIMI IIIIMIMIIIM 

Db 257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

llllllll llllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 376 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 360 

1 1 1 1 i 1 1 1 li 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M III II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i II II i II I II I 

Db 377 VSYAWGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 436 

Qy 361 REDPLGEAQPQI 372 

lllllllllill 
Db 43 7 REDPLGEAQPQI 44 8 



RESULT 3 

US-10-161-510-11 

Sequence 11, Application US/10161510 
Publication No. US200201926 95A1 
GENERAL INFORMATION: 
APPLICANT: EXELIXIS, INC. 

TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX02-074C 

CURRENT APPLICATION NUMBER: US/ 10/ 161 , 5 10 
CURRENT FILING DATE: 2002-06-03 
PRIOR APPLICATION NUMBER: US 60/296,076 

PRIOR FILING DATE: 2001-06-05 ' ■ 

PRIOR APPLICATION NUMBER: US 60/32 8,605 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/357,253 
PRIOR FILING DATE: 2002-02-15 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 11 
LENGTH: 47 8 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY : misc_f eature 
LOCATION: (56) . . (56) 

OTHER INFORMATION: M X" is any amino acid 
US-10-161-510-11 

Query Match 99.6%; Score 2012; DB 13; Length 478; 

Best Local Similarity 99.7%; Pred. No. 1.9e-187; 

Matches 371; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qv i MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

1 1 1 II I II II I II 1 1 1 1 II 1 1 II I II III II I II 1 1 1 II MM 1 1 1 1 M M 1 1 1 1 1 1 1 

Db 107 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 166 



Qy 61 KLYGYYVS I INCHLPPHI SNNYQRLEHFDR ILEMQNCEGRD I PNI LDHDLI I WFGDMNFR 120 

1 1 ' 1 1 II 1 1! II 1 1 1 ! I II 1 1 I M 1 1 1 1 II M II 1 1 III 1 1 1 1 1 II I II I 1 1 II 1 1 1 II 

Db 167 KLYGYYVS I INCHLPPH I SNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDL I I WFGDMNFR 226 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

IMIIIIIMIIIMIIII'IIIMIIIII llllllllll MM lllllllllllll 

Db 227 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 286 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

lllllllllllll IMIIMI II III II MM 1 1 1 1 1 1 1 1 1 II M IMIMI I II I 1 1 

Db 287 DTSEKKRKPAWTDRILWRLKRQACAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 346 

Qy 241 TFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWEW IGLYKVGLRDVNDY 300 

I II I M 1 1 II M I M I III M I II M II II M 1 1 II MM I II III I II I Ml I II I II 

Db 347 TFDLELKPLV S APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWDW I GLYKVGLRDVND Y 406 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

I M 1 1 1 1 1 II M M 1 1 1 1 1 I M M 1 1 M 1 1 I M 1 1 M 1 1 i 1 1 1 1 1 ! I M M 1 1 M 1 1 i 1 1 

Db 407 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSL 466 

Qy 3 61 REDPLGEAQPQI 3 72 

I I I I I I I I I I 1 t 
Db 467 REDPLGEAQPQI 4 78 



RESULT 4 

US-10-161-510-13 

; Sequence 13, Application US/10161510 

; Publication No. US20020192 695A1 

; GENERAL INFORMATION: 

/ APPLICANT: EXELIXIS, INC. 

; TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX02-074C 

CURRENT APPLICATION NUMBER: US/10/161 , 510 

CURRENT FILING DATE: 2002-06-03 
; PRIOR APPLICATION NUMBER: US 60/296,076 

PRIOR FILING DATE: 2001-06-05 

PRIOR APPLICATION NUMBER : US 60/328,605 
; PRIOR FILING DATE: 2001-10-10 
; PRIOR APPLICATION NUMBER: US 60/357,253 
; PRIOR FILING DATE: 2002-02-15 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE : Patent In, version 3.1 
; SEQ ID NO 13 
LENGTH: 372 
TYPE : PRT 
; ORGANISM: Homo sapiens 
US-10-161-510-13 

Query Match 99.5%; Score 2010; DB 13; Length 372; 

Best Local Similarity 99.5%; Pred. No. 2.1e-187; 

Matches 370; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I M Ml II 1 1 M M I II 1 1 Ml M MM 1 1 II II I II II 1 1 MM II II I MM I I 

Db 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 



Qy 61 KL YGY YVS I INCHLPPH I SNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDLI I WFGDMNFR 120 

Mill II II I III I M I II II II IM M II III II 1 1 M II II III II III II II I Mi 

Db 61 KLYGYYVS I INCHLPPHISNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDLI I WFGDMNFR 120 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

MM II I MM 1 1 II 1 1 II IMI II I II I II II 1 1 1 II I II M I II I II I II II II III 

Db 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

Qy 181 DT S E KKR KPAWTDR I LWRL KRQPCAG PDT PIP PAS H F S L S LRG Y S S HMT YG I S DHKP VS G 240 

M II MM 1 1 1 1 M 1 1 II Ml II Mill III 1 1 II III 1 1 II II 1 1 III 1 1 II I MM 

Db 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

IMI II Ml Ml II II IM II II MUM III III II MM II! Ill III III II 

Db 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSY3STSDFPSSPWDWIGLYKVGLRDVNDY 300 

Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

I M I - M ' M 1 1 1 II IMI 1 1 1 1 : M 1 1 1 1 M | 1 1 1 1 MIIIIMI IIIMIMM 

Db 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSWGIRRPFQIPPGSL 360 

Qy 3 61 REDPLGEAQPQI 3 72 

I II II II II I II 
Db 3G1 REDPLGEAQPQI 3 72 



RESULT 5 

US-10-161-510-12 

Sequence 12, Application US/10161510 
Publication No. US2 002 0192 695 Al 
GENERAL INFORMATION: 
APPLICANT: EXELIXIS, INC. 

TITLE- OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX02-074C 

CURRENT APPLICATION NUMBER: US/10/ 161 , 510 
CURRENT FILING DATE: 2002-06-03 
PRIOR APPLICATION NUMBER: US 60/296,076 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/32 8,605 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/357,253 
PRIOR FILING DATE: 2002-02-15 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 12 
LENGTH: 44 8 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-161-510-12 

Query Match 99.5%; Score 2010; DB 13; Length 448; 

Best Local Similarity 99.5%; Pred. No. 2.7e-187; 

Matches 3 70; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II II Ml III III I II 1 1 II II III II 1 1 II III II I IM II II II I II III I II I II II 

Db 77 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 136 



Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

I II II II 1 1 II ! 1 1 1 II 1 1 1 1 Ml 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 II II I III 1 1 1 1 II 1 1 1 

Db 137 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 196 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II I Ml 1 1 M 1 1 1 1 1 1 M I II III II II 1 1 Ml II 1 1 1 MM M II I II 1 1 1 Ml II 1 1 I 

Db 197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

I II III II M I 1 1 1 1 Ml I II I II I III 1 1 II II 1 1 1 1 II Mill 1 1 Ml I Ml II M 

Db 257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

Qy 241 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

lllllll IIIIIIIIIIMIIIIMIII MMIIIMI IIIIIMIIIIIIIIII 

Db 317 TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 376 



Qy 301 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGSL 360 

II III I II III II 1 1 Ml I III! Mil! Mill 1 1 MM MUM II I II I MM II I 

Db ' 377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSVVGIRRPFQIPPGSL 436 

Qy 3 61 REDPLGEAQPQI 3 72 

MIMIMIIII 

Db 437 REDPLGEAQPQI 44 8 



RESULT 6 
US-09-892-287-3 

; Sequence 3, Application US/09892287 
; Patent No. US20020127217A1 
GENERAL INFORMATION: 

APPLICANT: Hillman, Jennifer L. 
; " Lai, Preeti 

Corley, Neil C. 
; Shah, Purvi 

TITLE OF INVENTION: NEW HUMAN PHOSPHATIDYL INOSITOL 

(4,5) BISPHOSPHATE 5 -PHOSPHATASE 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
; • CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/892 , 287 

FILING DATE: 26-Jun-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/258,643 

FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 



NAME: Billings, Lucy J. 
REGISTRATION NUMBER : 36,749 
REFERENCE/DOCKET NUMBER: PF-0334 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE : 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO : 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 329 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 
CLONE: 1399101 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-09-892-287-3 

Query Match 88.7%; Score 1791; DB 9; Length 329; 

Best Local Similarity 99.4%; Pred. No. 4.3e-166; 

Matches 326; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 45 GLFG YWGNKGGVNI CLKL YGYYVS I INCHLPPHI SNNYQRLEHFDRI LEMQNCEGRD I PN 104 

1 1 : II 1 1 ! 1 1 1 II I III 1 1 M 1 1 1 1 1 1 1 II II 1 1 1 1 ! I ; I Ml 1 1 1 1 1 II li I II 

Db 2 GLFGYWGNKGGVN I CLKLYGYYVS I INCHLPPHI SNNYQRLEHFDRI LEMQNCEGRD I PN 61 

Qv 105 ILDHDLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGR 164 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I M I' II : II 1 1 II II I II 

Db 62 ILDHDLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGR 121 

Qy 165 LLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLICRQPCAGPDTPIPPASHFSLSLRGY 224 

MMMMMMIMMMMMMMMMIMM IMM IMM MMIMM 

Db 122 LLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGY 181 

Qv 225 S SHMTYGI SDHKPVSGTFDLELKPLVSAPL I VLMPEDLWTVENDMMVS YSSTSDFPS S PW 284 

I II III 1 1 1 1 1 II I 1 1 1 1 1 I II 1 1 1 1 1 1 II II 1 1 1 M ! 1 1 1 1 1 Ml 1 1 : l II I 1 1 1 

Db 182 S SHMTYGI SDHKPVSGTFDLELKPLVSAPL I VLMPEDLWTVENDMMVS YSSTSDFPS S PW 241 

Qy 2 85 DW IGLYKVGLRDVND YVS YAWVGDSKVSCSDNLNQVYIDI SNI PTTEDEFLLCYYSNSLR 344 

I r 1 1 1 f i 1 1 1 M 1 1 f 1 1 1 f 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 i e 1 1 1 i f 1 1 1 1 1 1 1 1 [ 1 1 1 M llll 

Db 242 DW I GL YKVGLRD VND YVS YAWVGDS KVS CSDNLNQVY I D I SN I PTTEDE FLLC Y YRNS LR 301 

Qy 345 SWGISRPFQIPPGSLREDPLGEAQPQI 372 

Mill II 1 1 1 II II III Ml II I II M 

Db 3 02 SWGIRRPFQIPPGSLREDPLGEAQPQI 32 9 



RESULT 7 
US-10-415-147-3 

; Sequence 3, Application US/10415147 
; Publication No. US20040043399A1 
; GENERAL INFORMATION: 
; APPLICANT: Bayer AG 

TITLE OF INVENTION: REGULATION OF HUMAN INOSITOL POLYPHOSPHATE 5 - PHOSPHATASE 
; FILE REFERENCE: LI0235Foreign Countries 

CURRENT APPLICATION NUMBER: US/ 10/4 15 , 147 

CURRENT FILING DATE: 2003-04-25 



PRIOR APPLICATION NUMBER: US 60/243,745 
PRIOR FILING DATE: 2000-10-30 
PRIOR APPLICATION NUMBER: US 60/257,302 
PRIOR FILING DATE: 2000-12-26 
PRIOR APPLICATION NUMBER: US 60/314,660 
PRIOR FILING DATE: 2001-08-27 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 3 
LENGTH: 1001 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-10-415-147-3 

Query Match 43.2%; Score 372.5; DB 15/ Length 1001; 

Best Local Similarity 47.5%; Pred. No. l.le-75; 

Matches 169.; Conservative 56; Mismatches 12 6; Indels 5; Gaps 4; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 . 

Il I I = h II ll!l|::||:|lll Ilh = = : I I III 11111111- I 
Db . 433 MD ALG P FNF VL VS TVRMQG VI LLLF AKY YHLPFLRDVQTDCTRTGLGG YWGNKGG VS VRL 542 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

:h : MINI h II -I II :| I MMIh IIIMM 

Db 54 3 AAFGHMLCFLNCHLPAHMDKAKQRKDNFQTILSLQQFQGPGAHGILDHDLVFWFGDLNFR 602 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : lllh I : llllll-M hh I I I I I I Ihlll I I 

Db 603 IESYDLHFVKFAIDSNQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 662 

Qy 1-81 DTSEKKRKPAWTDR I LWRLKRQPCAGPDTP I PPASH - FSLSLRGYS SHMTYG I SDHKPVS 2 39 

III I M I I I M I I M I -I | || I || :: | I | =111111 = 

Db 663 DTS AiCKRKPAWTDR I LWKVK- APSGGP - S PSGRESHRLQVTQHS YRSHME YTVSDHKPVA 72 0 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPS3PWDWIGLYKVGLRDVND 2 99 

I h Ihlll :| | : | | llt||||=|| | | 

Db 721 ARFLLQFAFRDDVPLVRLEVADEWARPEQAWRYRVETVFARSSWDWIGLYRVGFRHCKD 780 

Qv 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

I hi I =| h II = = | : hi llh :: \ :: M I I 

Db 781 YVAYVWAKHEEV- -DGNIYQVTFSEESLPKGHGDFILGYYSHHHSILIGVTEPFQI 834 



RESULT 8 

US-10-104-047-3268 

; Sequence 3268, Application US/10104047 

; Publication No. US20030236392A1 

; GENERAL INFORMATION: 

; APPLICANT: HELIX RESEARCH INSTITUTE 

; TITLE OF INVENTION: No. US2 003 023 63 92Alel full length cDNA 

; FILE REFERENCE: H1-A0105 

; CURRENT APPLICATION NUMBER: US/10/104 , 047 

; CURRENT FILING DATE: 2002-03-25 

; PRIOR APPLICATION NUMBER: 

; PRIOR FILING DATE: 

; NUMBER OF SEQ ID NOS: 4096 

; SOFTWARE: Patentln Ver. 2.1 



; SEQ ID NO 3268 
LENGTH: 63 9 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-104-047-3268 

Query Match 43.0%; Score 868.5; DB 15; Length 639; 

Best Local Similarity 47.8%; Pred. No. 1.4e-75; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; Gaps 4 

Qy 1 MDVLS PLS FI KVSHVRMQGI LLLVFAKYQHLPYIQI LSTKSTPTGLFGYWGNKGGVNI CL 60 

II I I :h II IMI|::|hlMI 111= = = = I I Ml 111111111 = = I 
Db 121 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 180 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

I : : | | | II I I ::| II I I Illllh ! I I I = I M 

Db 181 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 240 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : IMh :| : 1 I | E I i i = = E I 1 = 1= MM I I I hill =11 

Db 241- IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 300 

Qv 131 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III MM II I 111111 = = I I M =1 II = = I III I Mill Ih 

Db .3 01 DTSAKKRKPAWTDRILWIO/K-APGGGP-SP3GRKSHRLQVTQHSYRSHMEYTVSDHKPVA 353 

QV 20 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

I h Ihl II Ml = I I II II M h II ! I 

Db 359 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAVVP.YRMETVFARSSWDWIGLYRVGFRHCKD 418 

QV 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

Ihl I I I II : = | =1 = 1 Ml- -Ih MM 

Db 415) YVAYWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 472 



RESULT 9 

UG-10-100-260A-3425 

: Sequence 3425, Application US/10108260A 
; Publication No. US20040005560A1 
; GENERAL INFORMATION: 

; APPLICANT: HELIX RESEARCH INSTITUTE 

; TITLE OF INVENTION: No. US2 004 0005560Alel full length cDNA 
; FILE REFERENCE: H1-A0106 

; CURRENT APPLICATION NUMBER: US/ 10/ 108 , 260A 
; CURRENT FILING DATE: 2002-03-27 
; NUMEER OF SEQ ID NOS : 5458 
; SOFTWARE; Patentln Ver. 2.1 
; SEQ ID NO 3425 

LENGTH: 63 9 

TYPL- : PRT 
; ORGANISM: Homo sapiens 
US-10-100-2 60A-3425 

Query Match 43.0%; Score 868.5; DB 15; Length 639; 

Best Local Similarity 47.8%; Pred. No. 1.4e-75; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; Gaps 4 



Qy 1 MDVLSPLSFIKVSHWMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I = 1 = II I I I ll-ihll I I Ml:'-: : | | ||| ||.|| || || :: | 
Db 121 MDALGPFNFVLVSSWMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 180 



Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

:h : MINI h II = :| I :| :| Illllh lllhlll 

Db 181 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQT I LSLQQFQGPGAQG I LDHDLVFWFGDLNFR 24 0 

Qy 121 I EDFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : lllh I : Mlllll-ll hh I I I I I I 11 = 111 H I 

Db 241 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 3 00 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III I I II i II I I I I I I -I I II :| II I III I Hllllh 

Db 301 DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 358 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYK^/GLRDVND 2 99 

I h Ih I I I H I = I I t I I 1 II 1 = I I I I 

Db 35 9 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 418 

Qy 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

Ihl I I I II -I :hl III- -Ih I I I I 

Db 419 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 472 



RESULT 10 
US-10-114-270-84 ' 

Sequence 84, Application US/10114270 
Publication No. US20040030110A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Guo, Xiaoj ia 
Kekuda # Ramesh 
Miller, Charles E. 
Malyankar, Uriel M. 
Spytek, Kimberly A. 
Patturajan, Meera 
Liu, Ziaohong 
Gusev, Vladimir Y. 
Li, Li 

Vernet, Corine 
Zerhusen, Bryan D. 
Gorman, Linda 
Shenoy, Suresh G. 
Pena, Carol E.A. 
Smithson, Glennda 
Burgess, Catherine E. 
Gerlach, Valerie 
Padigaru, Mural idhara 
Shimkets, Richard A. 
Gangolli, Esha A. 
Taupier Jr., Raymond u 
Casman, Stacie J. 
Ji , Weizhen 
Anderson, David W. 
Liete, Mario W. 
Rastelli , Luca 
Edinger, Shlomit R. 



APPLICANT 
APPLICANT 
APPLICANT 



Stone, David J. 
MacDougall, John R. 
Rothenberg, Mark E. 

TITLE OF INVENTION: No. US2 004 003 OllOAlel Proteins and Nucleic Acids Encoding 
Same 

FILE REFERENCE: 21402 -322C 

CURRENT APPLICATION NUMBER : US/10/114 , 270 
CURRENT FILING DATE: 2002-11-27 



PRIOR 


APPLICATION 


NUMBER 




60/281, 086 


PRIOR 


FILING DATE 


2001- 


04 


-03 


PRIOR 


APPLICATION 


NUMBER 




60/281, 136 


PRIOR 


FILING DATE 


2001- 


04 


-03 


PRIOR 


APPLICATION 


NUMBER 




60/281, 863 


PRIOR 


FILING DATE 


2001- 


04 


-05 


PRIOR 


APPLICATION 


NUMBER 




60/281, 906 


PRIOR 


FILING DATE 


2001- 


04 


-05 


PRIOR 


APPLICATION 


NUMBER 




60/282, 020 


PRIOR 


FILING DATE 


2001- 


04 


-06 


PRIOR. 


APPLICATION 


NUMBER 




60/282, 930 


PRIOR 


FILING DATE 


2001- 


04 


-10 


PRIOR 


APPLICATION 


NUMBER 




60/282, 934 


PRIOR 


FILING DATE 


2001- 


04 


-10 


PRIOR 


APPLICATION 


NUMBER 




60/283 , 512 


PRIOR 


FILING DATE 


2001- 


04 


-12 


PRIOR 


APPLICATION 


NUMBER 




60/283,710 


PRIOR 


FILING DATE 


2001- 


04 


-13 


PRIOR 


APPLICATION 


NUMBER 




60/284,234 


PRIOR 


FILING DATE 


2001- 


04 


-17 



Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 47 0 
ID NO 84 
LENGTH: 1006 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-114-270-84 

Query Match 43.0%; Score 868.5; DB 15; Length 1006; 

Best Local Similarity 47.8%; Pred. No. 2.7e-75; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; Gaps - 4; 

Qy 1 MD VL S PL S F I KVS HVRMQG I LL L VFAKYQHL P Y I Q I L S TKS T P TGL FG YWGNKGG VN I C L 60 

Ml I :h II llllh:|hl!ll III::: : I I III MINIMI:: I 
Db 4 38 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 54 7 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

:|: : MUM h II : = | II I M lllll|: MlhlM 

Db 548 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 607 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 18 0 

II : lllh =| : |||lll|::|l I = I = II I I I I II = I I I Ml 

Db 608 I E S YDLHF VKF AI D S DQLHQLWE KDQLNMAKNTWP I LKGFQEG PLNF APTFKFD VGTNKY 667 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III I I II I II I II I II : M I II M M :: I Ml I Mllllh 

Db 668 DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 725 



QY 



240 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 299 



Ih Ihl II :| I : I I llllllhll I I 

Db 726 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAVVRYRMETVFARSSWDWIGLYRVGFRHCKD 785 

Qy 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

Ihl ! I I II = = | :hl 111== = = lh I I I I 

Db 786 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 839 



RESULT 11 
US-10-415-147-12 

Sequence 12, Application US/10415147 
Publication No. US200400433 99A1 
GENERAL INFORMATION: 
APPLICANT: Bayer AG 

TITLE OF INVENTION: REGULATION OF HUMAN INOSITOL POLYPHOSPHATE 5 - PHO 3 PHATAS E 
FILE REFERENCE: LI02 3 SForeign Countries 
CURRENT APPLICATION NUMBER: US/ 10/4 15 , 147 
CURRENT FILING DATE: 2003-04-25 
PRIOR APPLICATION NUMBER: US 60/243,745 
PRIOR FILING DATE: 2000-10-30 
PRIOR APPLICATION NUMBER: US 60/257,302 
PRIOR FILING DATE: 2000-12-26 
PRIOR APPLICATION NUMBER: US 60/314,660 
PRIOR FILING DATE: 2001-08-27 
NiVMBER OF SEQ ID NOS : 12 

SOFTWARE: Patentln version 3.1 * 
SEQ ID NO 12 
LENGTH: 1006 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-415-147-12 

Query Match 43.0%; Score 868.5; DB 15; Length 1006; 

Best Local Similarity 47.3%; Pred. No. 2.7e-75; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; Gaps 4; 

Qy i MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I =h II Illl-IMIM I I- = I I III Mllllllh: I 
Db 4 88 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 54 7 

Qy 61 KL YG Y YVS I INCHL P PH I SNNYQRLEHFDR I LEMQNCEGRD I PN I LDHDL I I WFGDMNFR 120 

= 1= = =11111 I II = = | II =| =| Mlllh lllhlll 

Db 54 3 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 607 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II = lllh i = 1111111-1 hh I I I I I I Ihlll =1 I 

Db 608 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 667 

Qy 181 DTSEKKRKPAWTDRI LWRLKRQPCAGPDTP I PPASH - FSLSLRGYS SHMTYGI SDHKPVS 239 

Ml Mllll llllll-l I II =1 M = = I III I =111111 = 

Db 668 DTSAKKRKPAWTDR I LWKVK- APGGGP - SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 725 

Qy 240 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 299 

11= Ihl II =11 = I I llllllhll I I 

Db 726 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 785 



Qy 



3 00 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 



Ill I llll = = l :| = l IN- = = ll= I I M 

Db 786 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 839 



RESULT 12 
US-10-415-147-2 

Sequence 2, Application US/10415147 
Publication No. US20040043399A1 
GENERAL INFORMATION: 
APPLICANT: Bayer AG 

TITLE OF INVENTION: REGULATION OF HUMAN INOSITOL POLYPHOSPHATE 5 -PHOSPHATASE 
FILE REFERENCE: LI02 35Foreign Countries 
CURRENT APPLICATION NUMBER: US/10/4 15 , 147 
CURRENT FILING DATE: 2003-04-25 
PRIOR APPLICATION NUMBER: US 60/243,745 
PRIOR FILING DATE: 2000-10-30 
PRIOR APPLICATION NUMBER: US 60/257 # 302 
PRIOR FILING DATE: 2000-12-26 
PRIOR APPLICATION NUMBER: US 60/314,660 
PRIOR FILING DATE: 2001-08-27 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 751 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-415-147-2 

Query Match 41.2%; Score 831.5/ DB 15; Length 751; 

Best Local Similarity 42.9%; Pred. No. 7.2e-72; 

Matches 169; Conservative 56; Mismatches 126; Indels 43; Gaps 6; 

Qy 1 MDVLSPLSFIKVSH VRMQG I LLL VFA 2 6 

II I I I hi IMIh:|hll 

Db 195 MDALGPFNFVLVTHPSPPGQPETLLNSWLQLYPGSLWGPLGLCGWVSSVRMQGVILLLFA 254 

Qy 27 KYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLE 86 

II Hh:: = I Mil t I 1 ! I I M I = = I = 1 = : HUM h II 
Db 2 55 KYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRLAAFGHMLCFLNCHLPAHMDKAEQRKD 314 

Qy 87 HFDR I LEMQNCEGRD I PNI LDHD LI IWFGDMNFRIEDFGLHFVRES IKNRCYGGLW 142 

:| M :| :| lllh h IMMIIIl = Mlh =1 = I I 

Db 315 NFQTILSLQQFQGPGAQGILDHEYGLGLVFWFGDLNFRIESYDLHFVKFAIDSDQLHQLW 374 

Qy 143 EKDQL3IAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQ 2 02 

||l||::|| hh llll I I 11 = 111 H llll I I I I II II I I I I h = I 
Db 375 EKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKYDTSAKKRKPAWTDRILWKVK-A 433 

Qy 203 PCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPED 261 

I II I II :: I III I :|IIMh I h Ih I I 

Db 434 PGGGP - SPSGRKSHRLQVTQHS YRSHMEYTVSDHKPVAAQFLLQFAFRDDMPLVRLEVAD 4 92 

Qy 262 LWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVY 321 

I HI = I I I I II I Ihl I I I I hi I I IN 

Db 493 EWWPEQAVWYRMET\ r FARSSWDWIGLYRVGFRHCKDYVAYVWAKHEDV--DGNTYQVT 550 

Qy 322 IDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 



Db 



551 FSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 584 



RESULT 13 
US-10-161-510-10 

Sequence 10, Application US/10161510 
Publication No. US20020192695A1 
GENERAL INFORMATION: 
APPLICANT: EXELIXIS, INC. 

TITLE OF INVENTION: PIBs AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX02-074C 

CURRENT APPLICATION NUMBER: US/ 10/161 , 510 
CURRENT FILING DATE: 2002-06-03 
PRIOR APPLICATION NUMBER: US 60/2 96,076 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/328,605 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/357,253 
PRIOR FILING DATE: 2002-02-15 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 10 
LENGTH: 1056 
TYPE: PRT 

ORGANISM : Homo sapiens 
US -10-161-510-10 

Query Match 41.2%; Score 831.5; DB 13; Length 1056; 

Best Local Similarity 42.9%; Pred. No. 1.2e-71; 

Matches 169; Conservative 56; Mismatches 126; Indels 43; Gaps- 6 

Qy 1 MDVLSPLSFIKVSH VRMQGILLLVFA 2 6 

II I I • I - hi IIMhHIHI 
Db 500 MDALGPFNFVLVTHPSPPGQPETLLNSWLQLYPGSLWGPLGLCGWVSSVRMQGVILLLFA 559 

Qy 27 KYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS I INCHLPPHISNNYQRLE 86 

II Hh:: = I I III I I I I E f t t E = = I = h : HIM! I || 
Db 560 KYYHLPFLRDVQTDCTRTGLGGYWGNKGGVS VRLAAFGHMLCFLNCHLPAHMDKAEQRKD 619 

Qy 87 HFDR I LEMQNCEGRD I PN I LDHD LIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLW 142 

:| || :| I ||||: j : Mlhlllll : Mlh H : M 

Db 62 0 NFQTILSLQQFQGPGAQGILDHEYGLGLVFWFGDLNFRIESYDLHFVKFAIDSDQLHQLW 679 

Qy 143 EKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLICRQ 202 

llll|::| hh I I I I I I hill =| INI I I I I I I I I I I I I h h 

Db 680 EKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKYDTSAKKRKPAWTDRILWKVK-A 73 8 

Qy 203 PCAGPDTPIPPASH- FSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPED 261 

I II M II :: I III I Hllllh I h lh I I 

Db 73D PGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVAAQFLLQFAFRDDMPLVRLEVAD 797 

Ov 262 LWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVY 321 

I Ml : I I I I I I II I : M I II I M I I III 

Db 798 EWVRPEQAWR YRMETVFARS S WDW I GL YR VG FRHCKD YVAYVWAKHED V - - DGNT YQ VT 855 

Qy 322 IDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 



Db 



856 FSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 8 89 



RESULT 14 

US-10-106-698-4755 

Sequence 4755, Application US/10106698 
Publication No. US20030109690A1 
GENERAL INFORMATION: 
APPLICANT: Ruben et al . 

TITLE OF INVENTION: Colon and Colon Cancer Associated Polynucleotides and 
Polypeptides 

FILE REFERENCE: PA005P1 

CURRENT APPLICATION NUMBER : US/ 10/ 106 , 6 98 
CURRENT FILING DATE: 2002-03-27 
PRIOR APPLICATION NUMBE1R : PCT/US00/26524 
PRIOR FILING DATE: 2000-09 -.2 8 
PRIOR APPLICATION NUMBER: US 60/157,137 
PRIOR FILING DATE: 1999-09-29 
PRIOR APPLICATION NUMBER: US 60/163,280 
PRIOR FILING DATE: 1999-11-03 
NUMBER OF SEQ ID NOS : 8564 
SOFTWARE: Patent In Ver. 3.0 
Sn'O ID NO 4 755 
LENGTH : 2 79 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY : MIS C_FE ATURE 
LOCATION: (247) 

OTHER INFORMATION: Xaa equals any of the naturally occurring L-amino acids 
US-10-106-698--4755 

Q;;-»ry Match 33.8%; Score 683.5; DB 14; Length 279; - • 

Best Local Similarity 52.8%; Pred. No. 5.1e-58; 

Matches 13 0; Conservative 39; Mismatches 74; Indels 3; Gaps 3 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I = h II MllhHhlMI llh:: : I I III llllllllh: I 
Db 10 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 6 9 

Qy 61 KLYG Y YVS I INCHLPPHI SNNYQRLEHFDR I LEMQNC EGRD I PNI LDHDL 1 1 WFGDMNFR 12 0 

:h ^ HUM h II -I II =| • I I I I I I I = I I I ! = I I I 

Db 70 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 12 9 

Cy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : lllh I : IIIIM-M hh I I I I I I ihlll :| I 

Db 130 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 189 

Oy 181 DTSEKKRKPAWTDRI LWRLKRQPCAGPDTPI PPASH - l^SLSLRGYSSHMTYGI SDHKPVS 239 

III I I I II I I II I I I - I II I II I III I :|lllll 

Db 190 DTSAKKRKPAWTDR I LWKVK- APGGGP - SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVX 247 

Qy 24 0 GTFDLE 245 

I h 

Db 248 AQFLLQ 253 



RESULT 15 

US-10-264-049-4157 

; Sequence 4157, Application US/10264049 

; Publication No. US2004 0005579A1 

; GENERAL INFORMATION: 

; APPLICANT: Birse et al . 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PA133P1 

; CURRENT APPLICATION NUMBER: US/10/264 , 049 
; CURRENT FILING DATE: 2002-10-04 

PRIOR APPLICATION NUMBER: PCT/US0 1/ 1856 9 
; PRIOR FILING DATE: 2001-06-07 
; PRIOR APPLICATION NUMBER: US 60/209,467 
; PRIOR FILING DATE: 2000-06-07 
; NUMBER OF SEQ ID NOS : 4 3 60 
; SOFTWARE: Patentln Ver. 3.1 
; SEQ ID NO 4157 

LENGTH: 162 

TYPE : PRT 
; ORGANISM: Homo sapiens 

FEATURE : 

NAME / KEY : MIS COFEATURE 
LOCATION: (12 0) 

; OTHER INFORMATION: Xaa equals any of the twenty naturally occurring L-amino 
acids 

FEATURE : 

NAME /KEY : MIS COFEATURE 
LOCATION: (13 0)" 

OTHER INFORMATION: Xaa equals any of the twenty naturally occurring L-amino 
acids 

US-10-264-049-4157 

Query Match 26.8%; Score 541.5; DB 15; Length 162; 

Best Local Similarity 74.6%; Pred. No. 1.7e-44; 

Matches 106; Conservative 0; Mismatches 7; Indels 29; Gaps 2; 
Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 



Db 



26 




85 



Qy 



61 




NYQRLKHFDRILEMQNCEGRDIPNIL 



106 



Db 



86 



130 



Qy 



1C7 



DHDLI IWFGDMNFRIEDFGLHF 12 8 



Db 



131 




Search completed: February 10, 2005, 18:05:21 
Job time : 135 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



February 10, 2005, 17:06:26 ; Search time 41 Seconds 

(without alignments) 
872.991 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-09-892-287-1 
2020 

1 MDVLSPLSFIKVSHVRMQGI . 



. FQIPPGSLREDPLGEAQPQI 372 



Scoring table: 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



Searched : 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Databcise 



PIR_79: * 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Prod. No. is the number of results predicted by chance to have a 
sccre greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result- 



% 

Query 



No. 


Score 


Match Length 


DB 


ID 


Description 
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ALIGNMENTS 



RESULT 1 
T42384 

inosi tol-1 , 4 , 5-trisphosphate 5 -phosphatase (EC 3.1.3.56) II, isoform 115K, 
membrane -associated - mouse 

N; Alternate names: inositol trisphosphate phosphomonoesterase 
C; Species: Mus musculus (house mouse) 

C;Date: 03~Dec-1999 #sequence_revision 03-Dec-1999 #text__change 09-Jul-2004 
C; Accession : T42384 

R;Matzaris, M . ; O'Malley, C. ; Badger, A. ; Speed, C; Bird, P.I.; Mitchell, C.A 
submitted to the EMBL Data Library, December 1997 

A; Description : Distinct membrane and cytosolic forms of inositol polyphosphate 
5-phosphatase II. Efficient membrane localization requires two discrete domain 
A; Reference number: Z22143 
A; Accession : T4 2 3 84 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-1017 <MAT> 

A;Cross-references : UNIPROT :054996 ; EMBL : AF040094 ; NID : g276652 8 ; PID : g276652 9 ; 

PIDN:AAB95412 . 1 

A; Experimented source: brain 

C; Genetics : 

A; Gene: INPP5P 

C; Keywords: phosphoric monoester hydrolase 



Query Match 



20.1%; Score 406.5; DB 2; Length 1017; 



Best Local Similarity 34.1%; Pred. No. 1.7e-26; 

Matches 86; Conservative 42; Mismatches 89; Indels 35; Gaps 4; 



Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFG YWGNKGGVNICLKLYG 64 

: II 11 = I I = I i = = I = I II = = = II I Mill :|: 

Db 415 YAKVKFVRLVGIMLLLYVKQEHAAYISEVEAETVGTGFMGRMSFQQGNKGGVAIRFQLHN 474 

Qy 65 YYVS I INCHLP PH I SNNYQRLEHFDR I LEMQNCEGRD I P NILDHDLI IWFG 115 

= ■= = 1111 =1 = = I I I I 11 = 1 = 1 I 

Db 475 TS I C WNSHLAAHTEE YERRNQD YRD I CSRMQFPQVDPSQPPLTINKHDVILWLG 529 

Qy 116 DMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDR 175 

hhllh = l== :: : : h III I = 111 = 1 lllhl 

Db 530 DLNYRIEELDVGKVKKLVEEKAFQTLYAHDQLKIQVAARTIFDGFTEGEITFQPTYKYDT 589 

Qy ■ 176 NSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDH 235 

hhllll I III 11111= I = = = I I I I III 

Db 590 GSDDWDTSESARAPAWCDRILWKGK NITQLSYQSHMALKTSDH 632 



Qy 2 36 KPVSGTFDLELK 247 

I I I I Ih = = 
Db 633 KPVSSVFDIGVR 644 



RESULT 2 
G59431 

phosphatidylinositol polyphosphate 5-phosphotase , isoform b [imported] - human 
C; Species: Homo sapiens (man) 

C;Date: 03-Jun-2002 #sequence_revision 03-Jun-2002 #text_change 23-Sep-2002 
C;Accession: G59431; H59431; A59443; B59432; C59432; D59432; E59432; F5943.2 
R;Duden, R. ; Griffiths, G.; Frank, R. ; Argos , P.; Kreis, T.E. 
Cell 64, 649-665, 1991 

A;Title: Beta-COP, a 110 kd protein associated with non-clathrin-coa ted vesicles 

and the Golgi complex, shows homology to beta-adapt in . 

A;Reference number: G59431; MUID : 91121521 ; PMID: 1840503 

A; Accession : G59431 

A; Status : preliminary 

A;Molecule type: DNA 

A/Rssidues: 1-893 <DUD> 

A;Cross-references : GB : NP_0 0157 8; PID : gl3325070 ; PIDN:NP_001578.2 
R;iMueller, O.T.; Hartsfield, J.K. Jr.; Gallardo, L.A.; Essig, Y.P.; Miller, 
K . L . ; Papenhausen , P . R . ; Tedesco , T . A . 
Am. J. Hum. Genet. 49, 804-810, 1991 

A; Title: Lowe oculocerebrorenal syndrome in a female with a balanced X;20 

translocation: mapping of the X chromosome breakpoint. 

A; Reference number: H59431 

A; Accession.: H59431 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues : 1-893 <MUE> 

A;Cros's-ref erences : GB : NP_0 01578; PID : gl3325070 ; PIDN : N?_001578 . 2 

R; At tree.. O.; Olivos, I.M.; Okabe, I.; Bailey, L.C.; Nelson, D.L.; Lewis, -R.A:. ; 

Mc Innu ?: . R . R . ; Nus sbaum , R . L . 

Nature 358, 239-242, 1992 

A;Title: The Lowe's oculocerebrorenal syndrome gene encodes a protein highly 
homologous to inositol polyphosphate -5 -phosphatase . 
A;Reference number: A59443; MUID : 92334430 ; PMID :. 1321346 



A; Accession : A5944 3 
A; Status : preliminary 
A; Molecule type: DNA 
A;Residues: 1-893 <ATT> 

A/Cross-references: GB : NP_001578 ; PID :gl3325070 ; PIDN :NP_001578 . 2 
R;Leahey, A.M.; Charnas, L.R. ; Nussbaum, R.L. 
Hum. Mol. Genet. 2, 461-463, 1993 

A/Title : Nonsense mutations in the OCRL-1 gene in patients with the 
oculocerebrorenal syndrome of Lowe . 

A;Reference number: B59432; MUID : 93278398 ; PMID:8504307 
A; Accession : B59432 
A;Status: preliminary 
A /Molecule type: DNA 
A/Residues: 1-893 <LEA> 

A;Cross-references : GB : NP_0 0157 8; PID : gl332 5070 ; PIDN :NP_001578 . 2 
R;Zhang, X.; Jefferson, A.B.; Auethavekiat , V.; Majerus, P.W. 
Proc. Natl. Acad. Sci . U.S.A. 92, 4853-4856, 1995 

A;Title: The protein deficient in Lowe syndrome is a phosphatidylinositol -4 , 5 - 

bisphosphate 5 -phosphatase . 

A; Reference number: C5 94 32 

A; Accession: C59432 

A; Status: preliminary 

A; Molecule type: DNA 

A/Residues : 1-893 <ZHA> 

A;Cros3-references : GB : NP_0 0157 8; PID : gl3325070 ; PIDN:NP_001578 . 2 
R;Nussbaum, R.L.; Orrison, B.M.; Janne, P. A.; Charnas, L . ; Chinault, A.C. 
Hum. Genet. 99, 145-150, 1997 

A/Title : Physical mapping and genomic structure of the Lowe syndrome gene OCRL1 . 
A; Reference number: D5 9432 
A; Accession: D59432 

A;^tatUi5: preliminary • ' 

A;Molecu]e type: DNA 
A;Residues: 1-893 <NUS> 

A; Cross-references: GB : NP_0 01578; PID :gl3325070 ; PIDN :NP_001578 . 2 

R;Janne, P. A. ; Suchy, S.F.; Bernard, D. ; MacDonald, M.; Crawley, J.; Grinberg, 

A.; Wynshaw-Boris , A.; Westphal, H. ; Nussbaum, R.L. 

J. Clin. Invest. 101, 2042-2053, 1998 

A;Titlo: Functional overlap between murine InppSb and Ocrll may explain why 
deficiency of the murine ortholog for OCRL1 does net cause Lowe syndrome in 
mice . 

A;Reference number: E59432 
A; Accession: E59432 
A/Status : preliminary 
A; Molscule type: DNA 
A; Residues: 1-893 <JAN> 

A;Cross- references: GB : NP_0 0157 8; PID : gl3325070 ; PIDN:NP_001578 . 2 
R;Erneux, C; Govaerts, C; Communi, D. ; Pesesse, X. 
Biochim. Biophys. Acta 1436 (1-2), 185-199, 1998 

A; Title: The diversity and possible functions of the inositol polyphosphate 5- 
phosphatases . 

A; Reference number: F59432 
A;Acce^sion: F59432 
A; Status: preliminary 
A; Molecule type: DNA 
A;Residu3£J: 1-893 <ERN> 

A;Cross-references: GB : NP_0 01578; PID : gl3325070 ; PIDN:NP_001578 . 2 



Query Match 18.2%; Score 367.5; DB 2; Length 893; 

Best Local Similarity 29.4%; Pred . No. 3.2e-23; 

Matches 100; Conservative 58; Mismatches 131; Indels 51; Gaps 8; 



Qy 5 SPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I : II Ih |::|hl|: Ih = = h= Ih I llllll = 

Db 3 07 S KAKYKKVQL VRLVGMMLL I F ARKDQCRY I RD I ATETVGTG I MGKMGNKGG VAVRF VFHN 3 66 

Qy 65 YYVSIINCHLPPHISNNYQRLEHFDRILEMQN- -CEGRDIP- -NILDHDLIIWFGDMNFR 120 

| = | || |= = :| : = | : : =| Ih h-ll Ihhl 

Db 367 TTFCIWSHLAAHVEDFERRNQDYKDICARMSFWPNQTLPQLNIMKHEWIWLGDLNYR 426 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 18 0 

: |: | : | = Mhl = =1 II = I MIM == = 

Db 427 LCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIPTYKYDSKTDRW 486 

Qy 1S1 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 240 

hi I I Ml MINI I III IIIIIM 

Db 4 37 DSSGKCRVPAWCDRILWR GTNVNQLNYRSHMELKTSDHKPVSA 52 9 

Qy 241 TFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS-STSDFPSSPWDWIGLYKV 292 

I : =| : I = =| :||l = 1 I =1 
Db 530 LFHIGVKWDERRYRKVFEDSVRIMDR MENDFLPSLELSRREF 572 

Qy 2'Jj GLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEP 332 

I : I : = I hi II I II I 

Db VFENVKFRQLQKEKFQISNN - GQVPCHFSFI PKLND 607 



RESULT 3 
S29069 

inositol polyphosphate -5 -phosphatase homolog - human 
C; Species*. Homo sapiens (man) 

C;Dats: 2 5 - j eb- 19 94 #sequence_revision 10-Nov-1995 #text_change 09-Jul-2004 : ■ 
C;Accession: S29069; 168621 

R;Attree, O. ; Olivos, I.M.; Okabe, I.; Bailey, L.C.; Nelson, D.L.; Lewis, R.A. 
Mclnnes, R.R.; Nussbaum, R.L. 
Nature 358, '^39-242, 1992 

A; Title : The Lowe's oculocerebrorenal syndrome gene encodes a protein highly 

homologous to inositol polyphosphate -5 -phosphatase . 

A; Reference number: S29069; MUID : 92334430 ; PMID-1321346 

A; Accession: S29069 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues : 1-970 <ATT> 

A; Cross-references : UNI P ROT : Q01968 

R;Leahey, A.M.; Charnas, L.R.; Nussbaum, R.L. 

Hum. Mol. Genet. 2, 461-463, 1993 

A; Title. Nonsense mutations in the OCRL-1 gene in patients with the 
oculocerebrorenal syndrome of Lowe . 

A;Reference number: 154349; MUID : 932783 98 ; PMID:8504307 
A; Accession: 168621 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 383-912 <RES> 

A;Cross-references : GB:S62085; NID:g385336; PIDN : AAB26926 . 1 ; PID:g385337 
C;Genetics : 



A; Gene: OCRL-1 



Query Match 18.2%; Score 367.5; DB 2; Length 970; 

Best Local Similarity 29.4%; Pred. No. 3.6e-23; 

Matches 100; Conservative 58; Mismatches 131; Indels 51; Gaps 8; 

Qy 5 SPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYG 64 

I = II lh |::||:||: lh ■ ■ \ ■ ■ lh I I I 1 I i I = 

Db 3 84 S KAKYKKVQLVRLVGMMLL I FARKDQCRY IRD I ATETVGTG IMGKMGNKGGVAVRFVFHN 443 

Qy 65 YYV3IINCHLPPHISNNYQRLEHFDRILEMQN- -CEGRDIP- -NILDHDLIIWFGDMNFR 120 

|:| II 1 = : =| = = I : : = I lh l= = = M 11 = 1 = 1 

Db 444 TTFCIVNSHLAAHVEDFERRMQDYKDICARMSFWPNQTLPQLNIMKHEWIWLGDLNYR 503 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 18 0 

= |= ! = I = ilhl = =111 = 1 1111 = 1 == = 

Db 50 4 LCMPD ANE VKS L I NKKDLQRLLKFDQLN I QRTQKKAFVD FNEGE I KF I PT YKYD S KTDRW 563- 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 24 0 

1 = 1 I I III MINI : = = I III I 1 I 1 I M 

Db 564 DSSGKCRVPAWCDRILWR GTNVNQLNYRSHMELKTSDHKPVSA 606 

Qy 241 TFDLELKPL VSAPLIVLMPEDLWTVENDMMVSYS -STSDFPSSPWDWIGLYKV 292 

I = =1 = I = =1 =111 = 1 I =1 
Db 607 LFHIGVKWDERRYRKVFEDSVRTMDR MENDFLPSLELSRREF 64 9 

Qy 2 93 GLRD VND Y VS Y AW VGD3 KVS C SDNLNQVY I D I SN I PTTED 33 2 

I = I = = I 1 = 1 II Mil 

Db 650 VFENVKFRQLQKGKFQ I SNN - GQVPCHFS F I PKLND 684 



RESULT 4 
A41075 

inositol-1,4, 5 - trisphcsphate 5 -phosphatase (EC 3.1.3.56) - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 12-Jun-1992 #sequence_revision 12-Jun-1992 #text_change 29-Aug-1997 
C; Accession : A41075 

R;Rosn, T.S.; Jefferson, A.B.; Mitchell, C.A.; Majerus, P.W. 
J. Biol. Chem. 266, 20283-20289, 1991 

A;Title: Cloning and expression of human 75-kDa inositol polyphosphate- 5 - 
phosphatase . 

A;Reference number: A41075; MUID : 92041857 ; PMID:171896C 

A; Accession : A41075 

A; Status: preliminary 

A; Molecule type : mRNA 

A;Residues: 1-672 <R0S> 

A; Cross-references : GB:M74161 

C; Genetics : 

A;Gene-: GDB : INPP5B 

A;Cross-references: GDB:129756; OMIM:147264 
A;Map position: Ip34-lp34 

C;Keywords: phosphoric monoester hydrolase 

Query Match 16.5%; Score 334; DB 2; Length 672; 

Best Local Similarity 31.7%; Pred. No. 1.6e-20; 

Matches 78; Conservative 45; Mismatches 97; Indels 26; Gaps 6; 



Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

: II =1= I I = I I = = I =| II : := Ih I MINI I = = = 
Db 133 YAKVKLIRLVGIMLLLYVKQEHAAYISEVEAETVGTGIMGRMGNKGGVAIRFQFHNTSIC 192 

Qy 69 I INCHLPPHISNNYQRLEHF -DRILEMQNCE -GRDI P - -NILDHDLI IWFGDMNFRIEDF 124 

: = l II II =1 : = I II 1 = = 1 I =11 = 1 = 1 11 = 1 = 111 = 

Db 193 WNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISNHDVILWLGDLNYRIEEL 252 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 

: h = 1= = = 1= III I = I II I I lllhl = I 

Db 2 53 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTRA- -LTTGI 310 

Qy 185 KKRKP AWTDR I LWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGT 241 

I = l!= I = = = I III IMIMI 

Db 311 PVRSAVLLPGVIGFLWKGK NITQLSYQSHMALKTSDHKPVSSV 3 53 

Qy 242 FDLELK 247 

11= = = 

Db 3 54 FDIGVR 3 59 



RESULT 5 
JW0105 

synaptoj anin. 2 alpha protein - mouse 
C; Species: Mua musculus (house mouse) 

C;Date: 10-Jul-1998 #sequence_revi sion 10- Jul -1998 #text _change 09- Jul -2004 
C; Accession : JW0105 

R;Seet, L.F.; Cho, S.; Hessel, A.; Dumont, D.J. 
Biochem. Biophys . Res. Commun. 247, 116-122, 1998 

A;Title: Molecular cloning of multiple isoforms of synaptojanin 2 and assignmen 

of the gene to mouse chromosome 17A2-3.1. 

A/Reference number: JW0105; MUID : 98300294 ; PMID: 9636665 

A; Contents: Liver 

A;Accessiou: JW0105 

A; Molecule type: mRNA 

A;Residues: 1-1216 <SEE> 

A;Cross-references : UNIPROT : Q9D2G5 ; GB:AF041862; NID : g324 1994 ; PIDN : AAC40146 . 1 ; 
PID:g3241995 

Query Match 16.5%; Score 334; DB 2; Length 1216; 

Best Local Similarity 31.2%; Pred. No. 3.7e-20; 



Matches 92; Conservative 49; Mismatches 124; Indels 30; Gaps 9; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

:| :: :: |= | :| : |:|:|: :: : ||: | |||| | | :|: 
Db 536 YILLTSAQLVGVCLYIFVRPYHVPFIRDVAIDTVKTGMGGKAGNKGAVGIRFQLHSTSFC 595 

Qy 69 IINCHLPPHISNNYQRLEHFDRIL-EMQNCEGRDIPNILDHDLIIWFGDMNFRIEDFGLH 127 

= 11 I =11=1== 111111 = 111 1 = 11 I 
Db 596 FVCSHLTAGQSQVKERNEDYREITHKLSFPSGR NIFSHDYVFWCGDFNYRI -DLTYE 651 

Qy 12 8 FVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKR 187 

I =| = = I I III = I : ::| M : | ||||:| I MM I 

Db 652 EVFYFVKRQDWKKLMEFDQLQLQKSSGKIFKDFHEGAVNFGPTYKYDVGSAAYDTSDKCR 711 

Qy 188 KPAWTDRILWRLKRQP CAGP DTPI PPASH-FSLSLRGYSSHMTYGISDHK 236 



Db 712 TPAWTDRVLWWRKKHPYDKTAGELNLLDSDLDGDPQIRHTWSPGTLKYYGRAELQASDHR 771 

Qy 237 PVSGTFDLELKPL VS APL I VLMPEDLWTVENDMMVS YS STSDFP 2 80 

II ::| = : = I h III II = = ll 

Db 772 PVLAI VEVE VQEVDVGARERVFQEVS S VQGPLDATVWNLQSPTLEEKNEFP 823 



RESULT 6 
S68448 

synaptojanin, 17 OK - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 17-Jul-1998 #sequence_revision 17-Jul-1998 #text_change 05-Nov-1999 
C;Accession: S68448; S78547; S78527 

R;McPherson, P.S.; Garcia, E.P.; Slepnev, V.I.; David, C.; Zhang, X.; Grabs, D.; 
Sossin, W.S.; Bauerfeind, R. ; Nemoto, Y . ; de Camilli, P. 
Nature 379, 353-357, 1996 

A; Title: A presynaptic inositol-5-phosphatase . 
A;Reference number: A58183; MUID : 96149250 ; PMID: 8552192 
A; Accession: S6844 8 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A;Residues: 1-1575 <MCP> 

A/Cross-references: EMBL:U45479; NID : gll66574 ; PIDN : AAB60525 . 1 ; PID : gll66575 ; 
?ID:gll56576 

A; Experimental source: brain 
A;Accession: S.78547 
A;Molecule type: protein 

A;Residues: 1075 - 1093 ; 1173 - 1199 ; 12 90 - 13 08 <MCS> 
A; Experimental source: brain 
R;de Camilli, P. 

submitted to the EMBL Data Library, January 1996 
A; Reference number: S78527 
A;Accession: S78527 
A; Molecule type: mRNA 

A;R^sidues: 1-587, 'D' , 589-1575 <DEC> 
A; Cross-ref erences : EMBL:U45479 
C; Function : 

A; Description: binds the SH3 domain of amphiphysin 

C; Keywords: alternative splicing; alternative termination; synaptic vesicle 
F; 1 - 15 7 5/ Product : synaptojanin, long form b #status predicted <MAT3> 
F; 1-1139, 1156-1575/Product : synaptojanin, long form a #status predicted <MAT2> 
F; 1-113, 1156-1308/Product : synaptojanin, short form #status predicted <MAT1> 

Query Match 16.2%; Score 328; DB 2; Length 1575; 

Best Local Similarity 33.6%; Pred. No. 1.7e-19; 



Matches 84; Conservative 44; Mismatches 106; Indels 16; Gaps 6; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

- - = = I : M I = I I I = I : = = = Ih I MM I I = = = 
Db 625 YVLLAS EQLVGVCL FVF I RPQHAP F I RD VAVDTVKTGMGGATGNKGAVAI RMLFHTTS LC 684 

Cy 69 IINCHLPPHISNNYQRLEHFDRIL-EMQNCEGRDIPNILDHDLIIWFGDMNFRIEDFGLH 127 

: I | :| | | | :: || = || = | || Mil I 

Db 685 FVCSHFAAGQSQVKERNEDFVEIARKLSFPMGR MLFSHDYVFWCGDFNYRI -DLPNE 740 

Qy 128 FVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKR 187 

hi h : : I Ml I : I I || = = I Mlhl I I II It I t I 



Db 



741 EVKELIRQQNWDSLIAGDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCR 800 



Qy 188 KPAWTDRILWRLKRQPC- -AGPDTPIPPASHFSLSLRGYS SHMTYG ISDHK 236 

IllllhIM :: I : I : || | |= = = || |||: 

Db 801 TPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRAELKTSDHR 860 

Qy 2 37 PVSGTFDLEL 24 6 

II h = = 

Db 861 PWALIDIDI 870 



RESULT 7 
D86190 

hypothetical protein [imported] - Arabidopsis thaliana 
C;Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 09-Jul-2004 
C;Accession: D86190 

R;Theologis , A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul , S.; White, O.; 
Alonso, J.; Altaf, H. ; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; 
Chan, A.; Chao, Q. ; Chen, H. ; Cheuk, R.F.; Chin, C.W.; Chung, M.K. ; Conn, L. ; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Feldblyum, T.V.; Feng, J.; Fong, B.; Fujii, C.Y.; Gill, J.E. ; Goldsmith, A . D . ; 
Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-82C, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; John son -Hop son, C; Khan, S.; Khaykin, E.; 
Kim, CJ.; Koo, H.L.; Kremene tskaia , I.; Kurtz, D.B.; Kwan, A.; Lam, B . ; Langin- 
Hocper, S . ; Lee, A.; Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y. ; Lin. X.; Liu, 
S.X.; Liu, Z.A.; Luros , J.S.; Maiti, R.; Marziali, A.; Militscher, J . ; Miranda , 
M. , Nguyen, M . ; Nierman, W.C.; Osborne, E.I.; Pai , G . ; Peterson, J.; Pham, P.K.; 
Rizzo, M . ; Rooney, T.; Rowley, D. ; Sakano, H. 

A;Auchors : Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; 
Tallon, L.J.; Tambunga, G. ; Toriumi, M.J.; Town, CD. ; Utterback, T.; van Aken, 
S.; Vaysberg, M . ; Vysotskaia, V.S.; Walker,. M.; Wu, D. ; Yu, G. ; Fraser, CM.; . 
Vender, J.C ; Davis, R.W. 

A/Title : Sequence and analysis of chromosome 1 of the plant Arabidopsis. 

A;T.eference number: A86141; MUID : 21016719 ; PMID : 11130712 

A;Accession: D86190 

A; Status: preliminary 

A;Molecule type: DMA 

A; Residues: 1-1136 <STO> 

A;Cross-references : UNIPROT : Q9SYK4 ; GB:AE005172; NID : g4836913 ; PIDN : AAD30615 . 1 ; 

GS?D:-i.GN00141 

C; Genetics: 

A; Map position: 1 

Query Match 15.3%; Score 3 0 9.5; DB 2; Length 1136; 

Best Local Similarity 27.3%; Pred. No. 4.3e-18; 

Matches 93; Conservative 52; Mismatches 109; Indels 87; Gaps 11; 

Qy 8 SFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYV 67 

:| :: :: |:|: ::|: :: | : | | ||||| = ===| = 

Db 651 TFERMGS R QLAGLL I S LWARKD I RTH VGDLD VAAVPCGFGRA I GNKGGVGLR I RVYDR I M 710 

Qy 68 SIINCHLPPHIS NNYQRLEHFDRILEMQNC EGRDIP 103 

:|MI h h II I I : I = II 

Db 711 CFVNCHLAAHLEAVNRRNADFNHIFRLMVFSRGQNLSNAAAAGVSTSAYTTKSNTIPSTG 770- 



Qy 104 NILDHDLIIWFGDMNFRIEDFGLHF- -VRESIKNRCYGGLWEKDQLSIAKKHDP 155 

= = h= =111 l = h 11= = hi I = 11 = 111 I 

Db 771 AEE I KSDLAAADMVAFFGDFNYRL - - FG ITYDEARDF I SQRS FDWLRERDQLRAEMKVGK 828 

Qy 15 6 LLREFQEGRLLFPPTYKFDRNSN DYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIP 212 

= = =1 = MINIM : ||: M I I Ml ll = = = l II 
Db 82 9 VFQGMREAL I TFPPT YKFERNRSGLGGYDSGEKKRI PAWCDRVI YR DTQSS 879 

Qy 213 PASHFSL S LRG YS SHMT YG I SDHKP VSGTF 242 

i i =i m 1 = 1 mill i 

Db 880 PFSESNLQCPWSSVIMYEACMDVTESDHKPVRCKFHATIAHVDKSVRRQELGKIIRSNE 939 

Qy 243 DLELKP- -LVSAPLIVLMPED- -LWTVENDMMVS 272 

II I II 111=1=1=1=1 
Db 940 KILS I FEDLRFVPETSVSTNNI VLQSQDTVILTITNNSPTS 980 



RESULT 8 
T00670 

probable inositol polyphosphate 5 ' -phosphatase [imported] - Arabidopsis thaliana 
N; Alternate names: hypothetical protein F6E13.3 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 01-Feb-1999 #sequence_ revision Ol-Feb-1999 #text_change 09-Jul-2004 
C/Accession: T00670; H84871 

R;Rounsley, S.D.;. Kaul, S . ; Lin, X.; Ketchum, K.A. ; Crosby, M.L.; Brandon, R.C; 
Sykes, S.M.; Mason, T.M.; Keriavage, A.R.; Adams, M.D.; Somerville, C.R.; 
Venter, J.C 

submitted to the EMBL Data Library, June 1998 

A; Description : Arabidopsis thaliana chromosome II BAG F6E13 genomic sequence. 
A/Reference number: Z14180 
A/Accession: T00670 

A; Status : translated from GB/EMBL/DDBJ 
A/Molecule type: DNA 
A;Residues: 1-1305 <R0U> 

A/Cross-references: UNIPROT : 080560 ; EMBL : AC0 0 4005; NID : g3212846 ; PID:g3212348 
A; Experimental source: cultivar Columbia 

R;Lin, X.; Kaul, S./ Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; 
Fujii, C.Y.; Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.-V. ; Buell, 
C.R.; Ketchum, K.A. ; Lee, J.J.; Ronning, CM. ; Koo, H.; Moffat, K.S.; Cronin, 
L.A.; Shen, M . ; VanAken, S.E.; Umayam, L. ; Tallon, L.J.; Gill, J.E.; Adams, 
M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, H.M. ; Somerville, C.R.; Copenhaver, 
G.P.; Preuss, D. ; Nierman, W.C; White, 0.; Eisen, J. A. ; Salzberg, S.L.; Fraser, 
CM.; Venter, J.C 
Nature 402, 761-768, 1999 

A;Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana . 

A;Reference number: A84420; MUID : 2 0083487 ; PMID : 10617197 
A;Accession: H84871 
A; Status: preliminary 
A; Molecule type: DNA 
A;ResidU0S: 1-1305 <ST0> 

A/Cross-references: GB:AE002093; NID : g3212848 ; PIDN : AAC23399 . 1 ; GSPDB : GN0013 9 
C;Genetics : 

A;Gene: F6E13.3; At2g43900 
A; Map position: 2 

A;IntronS: 278/1; 362/2; 631/3; 674/2; 702/3; 758/1; 771/3; 861/1; 902/2; 1022/3 



Query Match 15.1%; Score 306; DB 2; Length 1305; 

Best Local Similarity 27.1%; Pred. No. le-17; 

Matches 76; Conservative 55; Mismatches 97; Indels 52; Gaps 7; 



Qy 9 F I KVSHVRMQG I LLLVFAKYQHLP Y IQI LSTKSTPTGLFG YWGNKGGVNICLKLYGYYVS 6 8 

| :: := | = |: == = =: = = | I III Ml = : = = = = 

Db 658 FERMGSRQLAGLLISLWVRKNLRTHVGDIDVAAVPCGFGRAIGNKGGVGLRIRVFDRIMC 717 

Qy 69 IINCHLPPHISNNYQRLEHFDRILE MQNCEGRDIP 103 

IMII h =1 II I : == = I 

Db 718 FINCHLAAHLEAVNRRNADFDHIYKTMSFTRSSNAHNAPAAGVSTGSHTTKSANNANVNT 777 

Qy 104 NILDHDLIIWFGDMNFRIEDFGLHF- -VRESIKNRCYGGLWEKDQLSIAKKHDPL 156 

:: : [--Ml MM ||= = |= = | : | MM | = 

Db 778 EETKQDLAEADMWFFGDFNYRL- -FGISYDEARDFVSQRSFDWLREKDQLRAEMKAGRV 835 

Qy 157 LREFQEGRLLFPPTYKFDRNS NDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPP 213 

: :| • I I I II I M M I h I I I I I I I I h = M II I 

Db 836 FQGMREAIITFPPTYKFERHRPGLGGYDSGEKKRIPAWCDRVIFR DIRTS P 886 

Qy 214 ASHFSL SLRG YS SHMTYG I SDHKPVSGTFDLELK 247 

'III 1= I : I I I I I I I I =: = = 

Db 887 3 S E C S LD C P WAS I ML YD ACMD VTE S DH KP VRC KFHVKI E 926 



RESULT 9 
T19233 . 

hypothetical protein C16C2 . 3 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 09-Jul-2004 
C; Access ion : T1933 8 
R; Barlow, K. 

submitted to the EMBL Data Library, October 1996 
A; Raierence number: Z19109 
A; Accession: T19338 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Rcsidues: 1-753 <WIL> 

A;Cross-ref erences : UNIPROT : 017590 ; EMBL:Z81036; PIDN : CAB02 743 . 1 ; GSPDB : GNOOOi 9 ; 
CESP:C16C2 .3 

A; Experimental source: clone C16C2 

C; Genetics : 

A; Gene: CE3P:C16C2.3 

A; Map position: 1 

A;Introns: 17/1; 36/2; 88/2; 131/2; 170/3; 268/3; 389/2; 475/3; 498/3; 632/1; 
677/2; 713/1 

Query Match 14.9%; Score 300.5; DB 2; Length 753; 

Best Local Similarity 31.1%; Pred. No. 1.4e-17; 

Matches 85; Conservative 40; Mismatches 93; Indels 55; Gaps 9; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTG LFGYWGNKGGVNICLKLYGY 65 

I I :|= II ::|l = == I II I Mill = =1 = 

Db 168 FNWGSMRLVGIFVIVFQAWSKWVSDVWKYVATGISVLVNKLGNKGGTAVSMKMNDT 22 7 

Qv 66 YVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPN 104 

M =1 I II II M : Ml I 



Db 



228 



WVCFVNAHFA- - AGNNE 



LERRNQDFRDI YNDMVFYPRSQQEGLRDRPLEVP 276 



105 



ILDHDLIIWFGDMNFRI - -EDFGL- -HFVRESIKNRCYGGLWEKDQLSIAKKHDPLL 157 



Db 



277 



IMCLYDHDVVFWFGDIiNYRLNTDMYGISNDEVRRIASSDKFADLLQHCQLREQMARGTVF 336 



Db 



Qy 



337 



158 



REFQE-GRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH 216 

-hi I I lllhl :| hill I llllllll hi I : 
KDFEEPSTLPFRPTYKYDCGTNTWDTSEKGRVPAWTDRIL-TFKKYPQVGLE 387 



Qy 



217 



FSLSLRGYSSHMTYGISDHKPVSGTFDLELKPL 24 9 



Db 



388 



SIRPMVSIDTITISDHKPVRAMFNLKVKKI 417 




RESULT 10 
D96515 

hypothetical protein F16N3.22 [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-20Cl #text_change 09-Jul-2004 
C; Accession: D96515 

R,-Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A. ; Kaul, S.; White, O.; 
Alonso, J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, S. ; 
Chan, A.; Chao, Q. ; Chen, H. ; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L . ; ■ 
Conway, A.?,.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; - - . 
Feldblyum, T.V. ; Feng, J.; Fong, B.; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.- 
Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 403, 316-820, 2000 

A/Authors: Hunter, J.L.; Jenkins, J.; Johns on -Hop son, C. ; Khan, S.; Khaykin, E.; 
Kim. C.J.; Koo, H . L . ; Kremenetskaia , I.; Kurtz, D.B.; Kwan, A.; Lam, 3.; Langin- 
Hooper, S.; Lee, A.; Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y.; Lin, X.; Liu, 
S.X.; Liu, Z.A.; Luros , J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M-; Nguyen. M. ; Nierman, W.C.; Osborne, B.I.; Pai, G.; Peterson, J.; Pham, P.K.; 
Rizzo, M. ; Rooney, T.; Rowley, D . ; Sakano , H. 

A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A . M ; ; Sun, Hi; 
Tallon, L.J.; Tambunga, G.; Toriumi , M.J.; Town, CD.; Utterback, T. ; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M . ; Wu, D.; Yu, G.; Fraser, CM.; 
Venter, J.C.; Davis, R.W. 

A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 

A/Reference number: A86141; MUID : 2 1016719 ; PMID : 11130712 

A;Accession: D96515 

A; Status : preliminary 

A; Molecule type: DNA 

A/Residues; 1-331 <ST0> 

A; Cross-references: UNIPROT : Q9SX81 ; GB : AE005173 ; NID :gS663810 ; PIDN : AAD46036 . 1 ; 

GSPDB:GN00141 

C;Genetics : 

A; Gene: F16N3.22 

A; Map position: 1 

Query Match 14.8%; Score 2 98.5; DB 2; Length 331; 

Best Local Similarity 34.4%; Pred. No. 6.8e-18; 

Matches 83; Conservative 33; Mismatches 90; Indels 35; Gaps 9; 
Qy 16 RMQGILLLVFA-KYQHLPYIQILSTKSTPTGLFGYWG-NKGGVNICLKLYGYYVSIINCH 73 



Db 113 KLQS VQLYLFGPKNSHTLVKELKAERYS VGGCGGL IGRKKGAVAI R INYDD I KMVF I SCH 172 



Qy 74 LPPH I SNNYQRLEHFDR I LEMQNCEGRD I PNI L DHDL I I WFGDMNFR I EDFGLH 127 

II - :| I I I I I II =1 Ihhlhl I 

Db 173 LSAHAKK VDQRNTELRHIANSLLPRDKRKRDLTVWLGDLNYRIQDVSNH 221 

Qy 128 FVRES IKNRCYGGLWEKDQLS IAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKR 187 

II hi I I I I I : : : : M I I I I I I : = MUM | I 

Db 222 PVRSLIQNHLQSVLVSKDQLLQEAERGEIFKGYSEGTLGFKPTYKYNVGSSDYDTSHKIR 281 



Qy 188 KPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSS-HMTYGISDHKPVSGTFDLEL 246 

lllllllh::: || : :| | | || MINI II I 

Db 282 VPAWTDRILFKIQ DT DNIQATLHSYDSIDQVYG-SDHKPVKA- -DLCL 326 

Qy 247 K 247 

I 

Db 327 K 327 



RESULT 11 
G84792 

hypothetical protein At2g37440 [imported] - Arabidopsis thaliana 
C;Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02--Feb-2001 #sequence_revision 02-Feb-2001 #text_change 09-Jul-2004 
C; Accession : G84792 

It; Lin, X.; Kaul, 3.; Rounsley, S.D.; Shea, T.P.; Benito, M.I./ Town, CD.; 
Fujii, C.Y.; Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum; T.V.; Buell, 
C.R.; Kstchum, K.A.; Lee, J.J. ; Ronning, CM.; Koo, H. ; Moffat, K.S.; Cronin, 
L.A.; Chen, M . ; VanAken, S.E.; Umayam, L . ; Tallon, L.J.; Gill, J.E:,* Adams, 
M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, H.M.; Somerville, CR. ; Copenhaver, 
G.P.; Preuss, D . ; Nierman, W.C; White, 0.; Eisen, J. A. ; Salzberg, S.L.; Fraser, 
CM. ; Ventor, J.C 
Nature 402, 761-768, 1999 

A;Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
chaliana . 

A;Referenca number: A84420; MUID : 20083487 ; PMID : 10617197 
A; Accession: G84792 
A; Status : preliminary 
A;Molecule type: DNA 
A; Residues: 1-401 <STO> 

A; Cross -references : UNIPROT : Q9ZUS3 ; GB:AE002 093 ; NID : g4 056496 ; PIDN : AAC98062 . 1; 

GSPDB:GN00139 

C ; Genetics : 

A;Gene: At2g37440 

A; Map position: 2 

Query Match 14.7%; Score 2 97.5; DB 2; Length 4 01; 

Best Local Similarity 29.7%; Pred. No. l.le-17; 

Matches 76; Conservative 43; Mismatches 98; Indels 39; Gaps • 7; 

Qy 6 PLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGY 65 

I : : : I M M • : II h II Ml M = h 

Db 113 PRGYSLAASKQMVGIFLCVWVRDDLRKRITNLKVSCVGRGIMGYLGNKGSVSISMSLHET 172 

Qy 66 YVSIINCHLP PHI SNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDL I I WFGDMNF 119 

: : II : I III == : hill Ml M = l = 

Db 173 SLCFVCTHLTSGEKEGDELRRNLDVTEIFKRTRFSRSSKDSRPETIMDHDKVIWLGDLNY 232 



Qy 120 RIE-DFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSN 178 

h II I : = | : I Mill I - : = = = || = : | ||||: ||: 

Db 233 RLRASSDLH EQLRNHDWESLLEKDQLKIEQRAGRIFKGWEEGKIYFAPTYKYRINSD 289 

Qy 3 79 DY DTS E KKRKPAWTDR I LWR LKRQPCAGPDTPI PPASHFSLSLRGYSSHMTY 230 

I : Ihl IN Mlh :|: Ml I 

Db 290 NYWQTEKS KEKRRTPAWCDR I LWKGDGMKQ LWYVRGESK 329 



Qy 231 GISDHKPVSGTFDLEL 246 

MIMI | : : 
Db 330 - FSDHRPVQSLFSVHI 344 



RESULT 12 
S61667 

probable membrane protein YOR109w - yeast (Saccharomyces cerevisiae) 

N; Alternate names: hypothetical protein 03231; hypothetical protein YOR3231w 

C;Species: Saccharomyces cerevisiae 

C;Date: 09-Mar-1996 #sequence_revision 12-Apr-1996 #text_change 09-Jul-2004 
C/Accession: S61667; S66994 

R;Benes, V.; Andrade, M.A.; Rechmann, S.; Teodoru, C. ; Banrevi, A./ Sander, C. 

Valencia, A.; Ansorge, W. ; Voss, H. 

submitted to the EMBL Data Library, December 1995 

A/Description : Nucleotide sequence and analysis of a 13 0 kb fragment of yeast 

chromosome XV. 

A; Reference number: S6164 3 

A; Accession : S61657 

A; Molecule type : DNA 

A/Residues: 1-1107 <BEN> 

A/Cross-references: UNIPROT : Q12271 ; EMBL:X94335; NID : gl262139 ; PID:gll64954 
R/Vojs, H. ; Benes, V.; Rechmann, S . ; Teodoru, C; Schwager, C; Paces, -V. ; 
Ansorge, W. 

submitted to the Protein Sequence Database, July 1996 
A/R3ference number: S66965 
A;Accession: S66994 
A;Molecule type: DNA 
A/Residues : 1-1107 <VOS> 

A/Cross-references : EMBL : Z75017 ; NID : gl420294 ; PID : gl420295 ; MIPS:YOR109w 

A.; Experimental source: strain S288C 

C/Genetics : 

A; Gene: SGD:INP53 

A/Cross-references : SGD : S0005635 ; MIPS : YOR109w 

A.; Map position:- 15R 

C;Keywoxds: transmembrane protein 

7; 60-76/Domain : transmembrane ftstatus predicted <TM1> 
F; 642 - 65 8/Doinain : transmembrane #status predicted <TM2> 

Query Match 14.4%; Score 291.5; DB 2; Length 1107; 

Best Local Similarity 26.8%; Pred. No. 1.5e-16; 

Matches 98; Conservative. 57; Mismatches 141; Indels 69; Gaps 12 

Qy 15 RMQGI LLLVFAKYQHLPYI QILSTKSTPTGLFGYWGNKGGVN I CLKL YG YYVS I INCHLP 75 

M MM II |:: : : || ! |||| Ml : I II 

Db 648 QMTSLLILFFVKADKAKYVKQVEGATKKTGFRGMAGNKGAVSIRFEYGATSFCFVNSHLA 707 



QY 



76 PHISNNYQRLEHFDRILE-MQNCEGRDIPNILDHDLIIWFGDMNFRI EDFGLHFVR 13 0 



Db 



708 AGATNVEERRSDYES I VRGITFTRTKMI PH HDS I FWLGDMN YR I NL PNED VR 759 



Qy 131 ESIKNRCYG GLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKR 187 

: h I I llh: : hi I I lllhl : Ihllhl 

Db 760 RELLNQEEGYIDKLLHFDQLTLGINSGSVFEGFKEPTLKFRPTYKYDPGTGTYDSSEKER 819 

Qy 188 KPAWTDRILWRLKR-QPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTF 242 

hllllh:: : I : I II Nihil = 

Db 820 TPSWTDRIIYKGENLLPLSYSDAPIM ISDHRPVYAAYRAKI 860 

Qy 243 DLELKPLVSAPLIVLMPEDLWTVENDMM VSYSSTSDFPSSPWDWIGLYKV 292 

|||: ||= == :|:: : || | || 
Db 861 TFVDDKERLSLKKRLFTEYKQEHPEEPGSLISDLLSLDLDNKSTDGFKSSS ES 913 

Qy 293 GLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRP 352 

I |:= - I h I - : = I II = I I =1 

Db 914 SLLDIDPIMAQPTA- -SSVASSSPVSSASASLQPVRT QNSSQSRTPIKKP 961 

Qy 353 FQIPP 357 

II 

Db 962 VLRPP 966 



RESULT 13 
S48433 

inositol -1 , 4 , 5- trisphosphate 5-phosphatase homolog YIL002c- yeast 
(GcLCcharomyces cerevisiae) 
N; Alternate names: protein YIA2c 
C; Species: Saccharomyces cerevisiae 

C;Date: 02-Dec-1994 #sequence_revision 02-Dec-1994 #text_change 09-Jul-2004 
C/A-cession: S48433; S58703; S50873 
R;Badcock, K. ; Churcher, C. 

submitted to the EMBL Data Library, August 1994 
A/Refereace number: S4-8432 
A; Accession: S4 8433 
A; Molecule type: DNA 
A/Residues: 1-946 <BAD> 

A/Cross-references: UNIPROT : P4 055 9 ; GB:Z47047; EMBL:Z38052; NID:g603997; 
PID:g763344; MIPS:YIL002c 
R;Voss, H. 

submitted to the EMBL Data Library, June 1994 

A;Reference number: S58703 

A; Accession : S53703 

A; Molecule type: DNA 

A;Residues: 1-946 <VOS> 

A; Cross-references : EMBL:X7 974 3 

R;Voss, H.; Tamames, J.; Teodoru, C; Valencia, A.; Sensen, C . ; Niemann, S.; 
Schwager, C.; Zimmermann, J.; Sander, C. ; Ansorge, W. 
Yeact II, 61-78, 1995 

A; Title;:- Nucleotide sequence and analysis of the centromeric region of yeast 
chromosome IX . 

A; Reference number: S50795; MUID : 95282515 ; PMID:7762303 
A; Accession: S50873 

A; Status: nucleic acid sequence not shown 

A;Molecule type: DNA 

A; Residues : 650-820 <VOW> 

A; Cross -references : EMBL:X79743 



C;Genetics : 

A; Gene: SGD:INP51 

A/Cross-references : SGD : SO 00 12 64 ; MIPS:YIL002c 

A / Map pos i t ion : 9L 

C; Keywords: transmembrane protein 

F; 71-87/Domain: transmembrane #status predicted <TMM> 

Query Match 14.3%; Score 2 89.5; DB 2; Length 94 6; 

Best Local Similarity 31.5%; Pred. No. 1.7e-16; 

Matches 90; Conservative 38; Mismatches 115; Indels 43; Gaps 8; 



Qy 


9 


FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYV3 

= 1 = = -llllhl • 1 I 1 1 1 1 1 : 1 

Y I RLWS TQLGG I LLLL FMNETE YS KVKH I EGDVKKTGFGGMASNKGAVAV3 FKYS ATRFC 


63 


Db 


607 


666 


Qy 


69 


IINCHLPPHISNNYQRLEHFDRILE-MQNCEGRDIPNILDHDLIIWFGDMNFRI ED 

:: M : 1 II : | : :: :| I IN II 1 Mi II 
VLVSHLAAGLENVEQRHNDYKTIAKSIRFSKGL RIKDHDAI IWMGDFNYRILMSNED 


123 


Db 


667 


723 


Qy 


124 


FGLHFVRESIKNRCYGGLWEKDQLS IAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 

II 1 == 1 hllllh II : 1 1 1 : MINIM : 1 
VRRKIVSKEYASLFEKDQLNQQMIAGESFPY FHEMAIDFPPTYKFDPGTKNY 


180 


Db 


724 


775 


Qy 


181 


DTSEKKRKPAWTDR I LWRLKRQPCAGPDTPI PPASHFSLSLRG YS SHMTYG I SDHKPVSG 

Mill 1 IMIIMI 1 II IMMI 

DTSEKMRI PAWTDRILSRGE VLEQLEYKCCEDILFSDKRFVYA 


240 


Db 


776 


818 


Qy 


241 


TF DLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDF 279 




Db 


81:> 


1 | : | : : : M :::| :: | | 
IFRARVTWDEQKKTTLGTQIYEKIMERLEGLDDDEKIAVLSDDAF 864 





RESULT 14 
r 34 7 2D 

probable inositol polyphosphate 5 ' -phosphatase [imported] - Arabidopsis thaliana 
C/Speei^s: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 09-Jul-2004- • 
C;Accesrfio.a . F84725 

R;Lin, X.; Kaul, S.; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; 
Fujii, C.Y.; Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V.; Buell, 



C.R.; Ketchum, K.A. ; Lee, J.J.; Ronning, CM. ; Koo, H. ; Moffat, K.S.; Cronin, 
L.A.; Shen, M. ; VanAken, S.E.; Umayam, L.; Tallon, L.J.; Gill, J.E.; Adams, 
M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, H.M.; Somerville, C.R.; Copenhaver, 
CP. ; /Preuss, D.; Nierman, W.C; White, 0.; Eisen, J. A. ; Salzberg, S.L.; Fraser, 
CM. ; ■ Venter, J.C 



Nature 402, 761-768, 1999 

A;Tit?.e: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana. 

A;Reference number: A84420; MUID : 20083487 ; PMID : 10617197 
A; Accession : F84725 
A; Status: preliminary 
A;Moleculc type: DNA 
A;Residues: 1-1144 <STO> 

A;Cross-references: UNIPROT : Q9SKB7 ; GB:AE002093; NID : g4 8 87753 ; PIDN : AAD322 89 . 1 ; 
GSPDB:GN0013 9 
C; Gene tics : 
A;Gene: At2g31830 



A; Map position: 2 



Query Match 14.1%; Score 284; DB 2; Length 1144; 

Best Local Similarity 27.4%; Pred. No. 6.8e-16; 

Matches 75; Conservative 49; Mismatches 94; Indels 56; Gaps 8; 

Qy 8 SFIIO/SHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYV 67 

= 1 == == l = h == = == I : I I MINI "= = = = | = 

Db 660 TFERMGSRQLAGLLISLWVRKSIRTHVGDLDVAAVPCGFGRAIGNKGGVGLRIRVYDRIM 719 

Qy 68 SIINCHLPPHISNNYQRLEHFDRIL-EMQNCEGRDI PN 104 

= I I f t h =| hi I :h = II 

Db 72 0 CFWCHLAAHLEAVTRRNADFNHIYRSMVFSKGQSVYTAAAAGASTSAQALKNNPNTNNS 77 9 

Qy 105 I LDHDL 1 1 WFGDMNFR I EDFGLHF - - VRES I KNRC YGGLWEKDQLS I AKKHDP 155 

: Ih :||| hh lh = h I =1 : I Mill 
Db 730 TEEEKSKLASADLVAFFGDFNYRL- -FGITYDEARDFISHRSFDWLREKDQLRQEMNEGK 837 

Qy 156 LLREFQEGRLLFPPTYKFDRNS NDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIP 212 

: = =| = IMIMI-i Ih Nil Ml lh = :| : 
Db 83 8 VFQGMREALITFPPTYKFEKNKPGLGGYDSGEKKRIPAWCDRVIYRDNQS 8 87 

Qy 213 PASHFSLSLR GY S SHMT YG I SDHKP V 238 

h Ih 1 = 1 MINI 

Ob 888 -ISYTECSLKCPWSSTIMYEACMDVTESDHKPV 920 



RESULT ■ IS 
JC57G5 

inositol polyphosphate 5 -phosphatase (EC 3.1.3.-) 2 - human 
C; Species: Ilcmo sapiens (man) 

C;Date* 24-Jan-1998 #sequence_revision 13-Mar-1998 #text_change 09-Jul-2004 
C ;AcC2?oion : JC5765 

R;?esesse, X.; Deleu, S.; De Srr.edt , F. ; Drayer, L.; Erneux, C. 
Biochein. Biophys . Res. Commun. 239, 697-700, 1997 

A;Title: Identification of a second SH2 -domain- containing protein closely 
related to the phosphatidylinositol polyphosphate 5 -phosphatase SHIP. 
A; Reference number: JC5765; MUID : 98042455 ; PMID:9367831 ' •• 

A; Accr:0 3ion : JC57 65 

A;Suatus: nucleic acid sequence not shown 
A;Molecule type: mRNA 
A;Residucs: 1-1258 <PES> 

A;Crosc-references : UNIPROT : G15357 ; GB:Y14385; NID : g2653423 ; PIDN : CAA.74 743 . 1 ; 
PID:CllG8709; PID:g2653424 

C;Comment: This enzyme plays a role in metabolism of inositol phosphate and 
phosphatidylinositol. 

C/Xaywords: phosphoric monoester hydrolase 
F;21-117/Domain: SH2 homology <SH2> 

F;427-729/Dorr.ain: catalytic #status predicted <CAT> 
F ; 92 7-1171/Region : proline -rich 

Query Match 14.1%; Score 284; DB 2; Length 1258; 

Eest Local Similarity 25.0%; Pred. No. 7.7e-16; 

Matches 96; Conservative 66; Mismatches 154; Indels 68; Gaps 14 ; 

Qy 4 LSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLY 63 

h I : =: : | : | | :| I : I I I I I : I II I I : 



Db 



4 92 LTDLDYRPIAMQSLWNIKVAVLVKPEHENRISHVSTSSVKTGIANTLGNKGAVGVSFMFN 551 



Qy 64 GYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDL IIWFGDMNF 119 

I :|||| =| =:= || : = | : | | | : ||||:|: 

Db 552 GTSFGFWCHLTSGNEKTARRNQNYLDILRLLSLGDRQL-NAFDISLRFTHLFWFGDLNY 610 

Qy 12 0 R I EDFGLHFVRES I KNRC YGGLWEKDQLS I AK - KHDPLLREFQEGRLLFPPTYKFDRNSN 17 8 

hi * = I : : I III- = II II I I = llllh::| I 
Db 611 RL-DMDIQEILNYISRKEFEPLLRVDQLNLEREKHKVFLR-FSEEEISFPPTYRYERGSR 668 

Qy 179 D YDTSEKKRKPAWTDRI LWRLKRQPCAGPDTP I PPASHFSLSLRG YS SHMTYG 231 

I I : hi 11111= I H = I = : 

Db 669 DTYAWHKQKPTGVRTNVPSWCDRILWKSY PETHIICNSYGCTDDIV- - 714 

Qy 232 ISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVE NDMMVSYSSTSDFPSSPWDWIG 288 

III II |||:= : = = I =1 = =| :| = | :| 
Db 715 TSDHSPVFGTFEVGVTSQFISKKGLSKTSDQAYIEFESIEAI VKTASRTKF FIE 768 

Qy 289 LYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISN- - IPT TEDEFLL 336 

I I : I - I I h I = = I -= = I I =1= II 

Db 769 FYSTCLEE YKKS FENDAQS SDNINFLKVQWS SRQLPTLKP I LAD I E YLQDQHLL 322 

Qy 337 CYYSN SLRSWG 34 8 

:|:|::| 

Db 323 LTVKSMDG YES YGE CV VALKS M I G 846 



Search convicted: February 10, 2005, 17:50:42 
Job t irns: : 44 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



February 10, 2 005, 17:38:56 ; Search time 176 Seconds 

(without alignments) 
1082.349 Million cell updates/sec 

US-09-892-287-1 
2020 

1 MD VLS PI.iS F I KVSHVRMQG I FQIPPGSLREDPLGEAQPQI 372 



Scoring table; BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1612378 seqs , 512079187 residues 

Total miuiber of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1612378 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : UniProt_03 : * 

1 : uniprot_sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length DB 


ID 


Description 


1 


2010 


99 


. 5 


448 


1 


SKIP_HUMAN 


Q9bt40 


homo sapien 


2 


1525 


75 


. 5 


468 


1 


SKIP_MOUSE 


Q8c5l6 


mus musculu 


3 


1115 


55 


.2 


496 


2 


Q6GN99 


Q6gn99 


xenopus lae 


4 


873 . 5 


43 


. 2 


1003 


1 


P5PA_MOUSE 


P59644 


mus musculu 


5 


372 . 5 


43 


. 2 


1001 


1 


P5PA_RAT 


Q9 jmcl 


rattus norv 


6 


370 . 5 


43 


. 1 


638 


2 


Q62TH6 


Q6zth6 


homo sapien 


• 7 


864 .5 - 


42 


. 8 


1006 


1 


P5PA_HUMAN 


Q15735 


homo sapien 


8 


516 


25 


. 5 


501 


2 


Q7PYY6 


Q7pyy6 


anopheles g 


9 


507 .5 


25 


. 1 


508 


2 


Q9VXE7 


Q9vxe7 


drosophila 


10 


460 .5 


22 


. 8 


357 


2 


Q9V7X0 


Q9v7x0 


drosophila 


11 


432 . 5 


21 


. 4 


736 


2 


Q86YE1 


Q86yel 


homo sapien 


12 


432 . 5 


21 


. 4 


748 


2 


Q6PD53 


Q6pd53 


homo sapien 


13 


432 . 5 


21 


. 4 


871 


2 


Q6P6D4 


Q6p6d4 


homo sapien 


14 


432 . 5 


21 


. 4 


942 


1 


I5P2_HUMAN 


P32 019 


homo sapien 


15 


423 . 5 


21 


. 0 


742 


2 


Q8CF65 


Q8cf 65 


mus musculu 


16 


423 .5 


21 


. 0 


757 


2 


Q9DBW2 


Q9dbw2 


mus musculu 


17 


423 . 5 


21 


. 0 


776 


2 


054996 


054996 


mus musculu 


18 


423 .5 


21 


. 0 


825 


"} 


Q91ZF8 


Q91zf 8 


mus musculu 


19 


42J , 5 


21 


. 0 


993 


2 


Q8K337 


Q8k337 


mus musculu 


20 


334 


19 


. 0 


787 


2 


Q8I7P3 


Q8i7p3 


dictyosteli 


21 


373 .5 


18 


.5 


727 


2 


Q8BXC9 


Q8bxc9 


mus musculu 


22 


373 .5 


18 


.5 


900 


2 


Q6NVF0 


Q6nvf 0 


mus musculu 


23 


372 


18 


.4 


511 


2 


Q8BXT3 


Q8bxt3 


mus musculu 


24 


' 3 67 .5 


18 


.2 


901 


1 


OCRL_HUMAN 


Q01968 


homo sapien 


25 


341.5 


16 


. 9 


1800 


2 


Q8I7P5 


Q8i7p5 


dictyosteli 


26 


340 


16 


. 8 


1113 


2 


Q9XUD3 


Q9xud3 


caenorhabdi 


27 


340 


16 


.8 


1119 


2 


Q9GT42 


Q9gt42 


caenorhabdi 


28 


339 


16 


.8 


1101 


2 


Q84W55 


Q84w55 


arabidopsis 


29 


339 


16 


.8 


1291 


2 


Q9DGN7 


Q9dgn7 


lampetra fl 


30 


338 


16 


.7 


1101 


2 


Q9XFT1 


Q9xftl 


arabidopsis 


31 


335 


16 


.6 


1273 


2 


Q7Q001 


Q7q001 


anopheles g 


32 


334 


16 


.5 


1434 


1 


SYJ2_M0USE 


Q9d2g5 


mus musculu 


33 


328 


16 


.2 


552 


2 


Q6AVN0 


Q6avn0 


oryza sativ 


34 


328 


16 


.2 


1315 


2 


094984 


094984 


homo sapien 


35 


328 


16 


.2 


1574 


1 


SYJ1_M0USE 


Q8chc4 


mus musculu 


36 


328 


16 


2 


1574 


1 


SYJ1_RAT 


Q62910 


rattus norv 


37 


328 


16 


.2 


1575 


1 


SYJ1_HUMAN 


043426 


homo sapien 


38 


327 .5 


16 


.2 


1288 


2 


Q8IZA8 


Q8iza8 


homo sapien 


39 


327 .5 


16 


.2 


1443 


1 


SYJ2_HUMAN 


015056 


homo sapien 


40 


327 .5 


16 


.2 


1496 


2 


Q9H226 


Q9h226 


homo sapien 


41 


326 


16 


.1 


1248 


1 


SYJ2_RAT 


055207 


rattus norv 


42 


326 


16 


.1 


1451 


2 


Q91ZD9 


Q91zd9 


rattus norv 


43 


326 


16 


. 1 


1496 


2 


Q91ZD8 


Q91zd8 


rattus norv 



44 325 

45 320.5 



16.1 1324 1 
15.9 1107 2 



SYJ1_B0VIN 
Q6ZK18 



018964 bos taurus 
Q6zkl8 oryza sativ 



ALIGNMENTS 

RESULT 1 
SKIP_HUMAN 

ID SKIP_HUMAN STANDARD; PRT; 44 8 AA. 

AC Q9BT40; Q15733; Q9NPJ5; Q9P2R5; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Skeletal muscle and kidney enriched inositol phosphatase 

DE (EC 3 .1.3.56) . 

GN Name=SKIP; Synonyms =P PS ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

OX NCB I_Tax ID= 9606; 

RN [1] 

RP SEQUENCE FROM N . A. (ISOFORMS 1 AND 2), FUNCTION, AND TISSUE 

RP SPECIFICITY . 

RC TISSUE^Testis; 

RX MEDLINE=2 021912 3 ; PubMed=10753 883 ; DOI=10 . 1074/ j be . 27 5 . 15 . 10870 ; 

RA Ijuin T. ,. Mochizuki Y., Fukami K. , Funaki M . , Asano T., Takenawa T. ; 

RT "Identification and characterization of a novel inositol polyphosphate 

RT 5 -phosphatase . " ; 

RI, J. Biol. Chem. 275:10870-10875 (2000). 

RN [2] 

RP SEQUENCE FROM N. A. (ISOFORM 1) . 

RC TIS3UE=Eye; ! : 

RX MEDLINE =22388257 ; PubMed=124 77932 ; DOI=10 . 1073 /pnas . 242603 899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klausner R.D. , Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. # Buetow K.H. , Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F., 

RA Dia'chenko L., Marusina K. , Farmer A. A. , Rabin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E . , . 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H. , 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M . , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT -?,nd mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

mi [3] 

RP SEQUENCE OF 120-448 FROM N.A. 

RC TISSUE=Brain; 



RA Nussbaum R.L.; 

RL Submitted (JAN-1996) to the EMBL/GenBank/DDBJ databases . 

RN [4] 

RP SUBCELLULAR LOCATION, AND MUTAGENESIS OF TYR-34 9; ASP-3 61; TRP-362 AND 

RP TYR-376. 

RX MEDLINE=22538472; PubMed=12536145 ; DOI = 10 . 1074 / jbc . M2099912 0 0 ; 

RA Gurung R. , Tan A., Ooms L.M., McGrath M.J., Huysmans R.D., 

RA Munday A.D. , Prescott M . , Whisstock J.C., Mitchell C.A.; 

RT "Identification of a novel domain in two mammalian inositol- 

RT polyphosphate 5 -phosphatases that mediates membrane ruffle 

RT localization. The inositol 5 -phosphatase SKIP localizes to the 

RT endoplasmic reticulum and translocates to membrane ruffles following 

RT epidermal growth factor stimulation."; 

RL J. Biol. Chem. 278:11376-11385(2 003). 

CC -!- FUNCTION: Inositol 5-phosphatase which acts on inositol 1,4,5- 

CC trisphosphate, inositol 1 , 3 , 4 , 5 -tetrakisphosphate , 

CC phosphatidylinositol 4 , 5-bisphosphate and phosphatidyiinositol 

CC 3 , 4 , 5 -triphosphate . Has 6 -fold higher affinity for 

CC phosphatidylinositol 4 , 5-bisphosphate than for inositol 1,4,5- 

CC trisphosphate. May negatively regulate assembly of the actin 

CC cytoskeleton . 

CC -!- CATALYTIC ACTIVITY: D-myo- inositol 1 , 4 , 5 -trisphosphate + H(2)0 = 
CC myo- inositol 1 , 4 -bisphosphate + phosphate. 

CC SUBCELLULAR LOCATION: Endoplasmic reticulum. Following stimulation 

CC wich EGF, translocates to membrane ruffles. 

CC " - i - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing,- Named isoforms=2; 

CC ■ Name=l; 

CC IsoId=Q9BT40-l ; 3equence=Displayed ; 

CC Name =2 ; 

CC IsoId=Q9BT40-2; Sequence=VSP_050612 ; 

CC -!- TISSUE SPECIFICITY: Ubiquitously expressed with highest levels in 
CC skeletal muscle, heart and kidney. 

CC -l- SIMILARITY: Belongs to the inositol - 1 , 4 , 5- trisphosphate 5- 
CC phosphatase type II family . 

CC 

CC .This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC botween the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions • on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not' removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www . isb-sib . ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; A3036829; BAA92340.1; -. 

DR EMBL; A3036830; BAA92341.1; -. 

DR EMBL; A3036831; BAA92342.1; -. • 

DR EMBL; BC004362; AAH04362.1; -. 

DR EMBL; U45973; AAB03214.1; -. 

DR HSSP; 043001; 1I9Z. 

DR MIM; 607875; -. 

DR GO; GO:0005829; C:CytOSOl; ISS . 

DR GO; GO:0043005; C:neuronal cell projection; ISS. 

DR GO; GO: 0016312; F: inositol bisphosphate phosphatase activity; ISS. 

DR GO; GO: 0046030; F: inositol trisphosphate phosphatase activity; ISS. 

DR GO; GO: 0042577; F: lipid phosphatase activity; ISS. 

DR GO; GO: 0030036; P:actin cytoskeleton organization and biogenesis; ISS. 



DR 


InterPro; 


IPR005135 ; Exo_endo _phos . 




DR 


InterPro ,- 


IPR000300; IPPc. 






DR 


Pfam; PF03372; Exo_ 


_endo_phos 


; 1. 




DR 


SMART; SM00128; IPPc; 1. 






KW 


Alternative splicing; Endoplasmic reticulum; Hydrolase. 




FT 


DOMAIN 


16 


318 


Catalytic (Potential) . 




FT 


DOMAIN 


321 


448 


Required for ruffle localization. 




FT 


VARSPLIC 


1 


76 


Missing (in isoform 2) , 




FT 








/FTId=VSP 050612. 




FT 


MUTAGEN 


349 


349 


Y->A,F: No effect on EGF-induced 


ruffle 


FT 








localization . 




FT 


MUTAGEN 


361 


361 


D->A: Significant decrease in EGF 


-induced 


FT 








ruffle localization. 




FT 


MUTAGEN 


362 


3 62 


W->A: Significant decrease in EGF 


-induced 


FT 








ruffle localization. 




FT 


MUTAGEN 


376 


376 


Y->A,F: No effect on EGF-induced 


ruffle 


FT 








localization. 




FT 


CONFLICT 


120 


120 


T -> A (in Ref . 3) . 




FT 


CONFLICT 


416 


416 


R - > S (in Ref. 2) . 




FT 


CONFLICT 


426 


426 


R -> S (in Ref. 2) . 




SQ 


SEQUENCE 


448 AA; 


51228 MW; 0651BE3C6E2EEB0F CRC64 ; 




Query Match 




99.5%; 


Score 2 010; DB 1; Length 44 8; 




Best Local Similarity 


99.5%; 


Pred. No. l.le-164; 




I< 


latches 3 70; 


Conservative 


0; Mismatches 2; Indels 0; 


Gaps • 



0; 



Ov 



Db 



1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWCNKGGVNICL 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i r f 1 1 1 1 1 1 1 1 1 1 > ! i 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

77 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 13 6 



Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 



6 1 KLYGYYVS I INCHLPPH I SNNYQRLEHFDR ILEMQNCEGRD I PNILDHDLI I WFGDMNFR 12 0 

1 1 E 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 1 E M I M I ! 1 1 1 1 1 1 1 1 It I M M M 1 1 1 1 1 1 1 1 1 M 

137 KLYGYYVS I INCHLPPHI SNNYQRLEHFDR ILEMQNCEGRD I PNILDHDLI I WFGDMNFR 196 

121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II I II I I II I II M II I M I I MM II III II II I I I I I II I I II I II M III M III 
197 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 256 



181 



240 



DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 

M I Ml I I I II I I I I I I I I I I III I III I I ' I M I ! I I I i! I I I II I I I I i II I 'I I I i 
257 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSG 316 

241 TFDLELKPLVSAPLIVLMPEDLWTVEIvTDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 3 00 

II II II I I I II I I I II II I III I I llllll I I I I I I I II I I I I I II I I I I I M I I I I II 
TFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 



317 



376 



301 VS YAWVGDSKVSCSDNLNQVYIDI SNI PTTEDEFLLCYYSNSLRS WGI SRPFQI PPGSL 360 

1 1 II 1 1 1 1 1 II 1 1 1 II II 1 1 1 MM Mill I II 1 1 II II llllllll lllllllll 

377 VSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYRNSLRSWGIRRPFQIPPGSL 436 



Qy 

Db 



3 61 REDPLGEAQPQI 3 72 

I I I I 1 E I I I I I I 
437 REDPLGEAQPQI 44 8 



RESULT 2 
SKIP MOUSE 



ID SKIP_MOUSE STANDARD; PRT; 4 68 AA. 

AC Q8C5L6; 009040; 

DT 10-OCT-2003 (Rel . 42, Created) 

DT 29-MAR-2004 (Rel. 43 , Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Skeletal muscle and kidney enriched inositol phosphatase 

DE (EC 3.1.3.56) . 

GN Name=Skip; Synonyms =Pps ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCB I _T ax I D = 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=9732 5785; PubMed=9182 7 97 ; DOI=10 . 1016/S0896-62 73 (00) 80312-8; 

RA Hamilton B.A., Smith D.J., Mueller K.L., Kerrebrock A.W., 

RA Bronson R.T., van Berkel V., Daly M.J. , Kruglyak L., Reeve M.P., 

RA Nemhauser J.L., Hawkins T.L., Rubin E.M. , Lander E.S.; 

RT "The vibrator mutation causes neurodegeneration via reduced expression 

RT of PITP alpha: positional complementation cloning and extragenic 

RT suppression."; 

RL Neuron 18:711-722(1997). 

RN [2] 

RP SEQUENCE FROM N.A. - 

RC CTRAIN-ILS, and ISS; 

RA. Ehringer M.A., Thompson J., Conroy O., Xu Y. , Yang F. , Canniff J., 
RA - 3ee^on M . , Gordon L. , Bennett B., Johnson T.E., Sikela J.M. : 

RL Submitted (FEB-2002) to the EMBL/ GenBank/DDB J databases. 

RN- [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE^Medulla oblongata, and Ovary; 

RX MEDLINE=22354683 ; PubMed=12466851 ; DOI=10 . 103 8/nature01266 ; 

?J\ Okazaki Y. , Furuno M., Kasukawa T. , Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K. , Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T. , 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H. , Batalov S., Beisel K.W., 

RA '3lake J. A., Bradt D., Brusic V., Chothia C. , Corbani L.E., Cousins S., 

RP, Dalla E., Dragani T.A., Fletcher C.F., Forrest A. , Frazer K.S., 

RA Gaasterland T. , Gariboldi M. , Gissi C. , Godzik A. , Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N. , Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H. , Kawasawa Y., Kedzierski R.M., King B.L., 

RA Xonagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L. , McKenzie L., Miki H., 

RA Nagashima T., Numata K. , Okido T., Pavan W.J., Pertea G., Pesole G. , 

RA Petrovsky N. , Pillai R. , Pontius J.U. , Qi D. , Ramachandran S., 

RA Ravasi T., Reed J.C. , Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C. , Semple C.A., Setou M. , Shimada K. , 

RA Sultana R . , Takenaka Y. , Taylor M.S., Teasdale R.D., Tomita M . , 

RA Verardo R. , Wagner L., Wahlestedt C, Wang Y. , Watanabe Y. , Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L., 

RA Yuan Z., Zavolan M. , Zhu Y., Zimmer A., Carninci P., Hayatsu N. , 

RA Ilirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N., Sato K. , 

RA Shiraki T., Waki K. , Kawai J., Aizawa K. , Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K. , Ishii Y. , Itoh M. , Kagawa I,, 

RA Miyazaki A., Sakai K. , Sasaki D., Shibata K. , Shinagawa A., 

RA Yasunishi A., Yoshino M . , Waterston R., Lander E.S., Rogers J., 



RA Birney E., Hayashizaki Y.; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

CC -!- FUNCTION: Inositol 5-phosphatase which acts on inositol 1,4,5- 

CC trisphosphate , inositol 1 , 3 , 4 , 5 - tetrakisphosphate , 

CC phosphatidylinositol 4 , 5-bisphosphate and phosphatidylinositol 

CC 3 , 4 , 5 -triphosphate . Has 6-fold higher affinity for 

CC phosphatidylinositol 4 , 5-bisphosphate than for inositol 1,4,5- 

CC trisphosphate. May negatively regulate assembly of the actin 

CC cytoskeleton (By similarity) . 

CC -!- CATALYTIC ACTIVITY: D-myo- inositol 1 , 4 , 5 -trisphosphate + H(2)0 = 

CC myo-inositol 1 , 4 -bisphosphate + phosphate. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum. Following stimulation 

CC with EGF , translocates to membrane ruffles (By similarity) . 

CC -!- SIMILARITY: Belongs to the inositol - 1 , 4 , 5 -trisphosphate 5- 

CC phosphatase type II family. 

CC 

CC This SWISS -PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://ww.isb-sib.ch/anriounce/ 

CC or send an email to license@isb-sib.ch). 
CC 

DR . EMBL; f J9b724; AAC53265.1; -. 

DR ZMBL; U96726; AAC60757.1; -. . 

DR EMBL; AF483522; AAL90796.1; -. 

DR 2M3L; AF483523; AAL90797.1; -. 

DR EMBL; AK054436; BAC35778.1; -. 

DR EMBL; AK078104; 3AC37126.1; -. 

DR HSSP; 043001; 1I9Z. 

DR MGD; MGI : 1194 8 99 ; Pps . 

DR GO; GO: 0005829; C:cytosol; ISS . 

DR GO; GO: 0043005; Cmeuronal cell projection; ISS. 

DR GO; GO:0016312; F:inositol bisphosphate phosphatase activity; ISS. 

DR CO; GO: 0046030; F: inositol trisphosphate phosphatase activity; ISS. 

DR GO; GO: 0042577; F: lipid phosphatase activity; ISS. 

DR GO; GO: 003003 6; P:actin cytoskeleton organization and biogenesis; ISS. 

DR InterPro; IPR005135; Exo_endo _phos . 

DR InterPro; IPR000300; IPPc. 

DR Pfam; PF03 3 72; Exo_endo_phos ; 1. 

DR SMART; SM0012 8; IPPc; 1. 

KW Endoplasmic reticulum; Hydrolase. 

FT DOMAIN 34 337 Catalytic (Potential) . 

FT. DOMAIN 340 468 Required for ruffle localization. 

FT CONFLICT 329 329 D -> E (in Ref . 3; BAC37126) . 

SQ - SEQUENCE 468 AA; 54158 MW; F2E1CA370B97A8A1 CRC64 ; 

Query Match 75.5%; Score 1525; DB 1; Length 468; 
Best Local Similarity 75.7%; Pred. No. 7.8e-123; 

Matches 2 83; Conservative 39; Mismatches 50; Indels 2; Gaps 2; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAXYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

Ihlllhhhl I h I hh II h 1 1 h 1 1 1 II hi 1 1 1 1 1 1 1 hh 1 1 h II I hi I 

Db 95 MDMLSPLNFVKISQVRMQGLLLLVFAKYQHLPYIQIISTKSTPTGLYGYWGNKGGVNVCL 154 



Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

1 1 1 1 1 II 1 1 1 1 1 II 1 1 h II MIIIIIIMI II Ml h 1 1 I hh 1 1 1 II 1 1 

Db 155 KLYGYYVS I INCHLPPHMYNNDQRLEHFDRILESLTFEGYDVPNILDHDLILWFGDMNFR 214 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

llllll IMII : I IIIIMI III- I I E I 1 I I I I IIIMI Ihlhl 
Db 215 IEDFGLLFVQESITRKYYKELWEKDQLFIAKKNDQLLREFQEGPLLFPPTYKFDRHSNNY 274 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPC-AGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVS 23 9 

I I I I I I I I I I I I I I I I : I I I I I || I hi hh I III I llllllh 

Db 275 DTSEKKRKPAWTDR I LWRLKRQPSQAS PLAS S VPTS YFLLTLKNYVSHMAYS I SDHKPVT 334 

Qy 24 0 GTFDLELKPLVSAPLIVLMPEDLWTV2NDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

IIIIMI Ihl III hll 1 1 hi 1 1 hh hll I 1 1 1 1 II I II 1 1 hi hi 

Db 33 5 GTFDLELNPLMSVPLITMMPEHLWTMENDMLISYTSTPEFLSSSWDWIGLYKVGMRHIND 3 94 

Qy 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIPPGS 359 

1 1 : 1 Mil- 1 1 =| llllhll II 1 1 1 = 1 1 1 ! I M I - 1 lllllhlhll I 

Db 3 95 YVAYVWVGDNQVSYGNNPNQVYINISAIPDTEDQFLLCYYSNNLHSWGISQPFKIPIRS 4 54 

Qy 3 60 - LREDPLGEAQPQI 372 

I I I I I I hll 
Db 45 5 FLREDTLYEPEPQI 4 68 

3 

QCGXDj ■ ■ 

ID Q6CM99 PRELIMINARY; PRT; 4 96 AA. 

AC 06GN99; 

DT 05-JUL-2004 (TrEMBLrel . 27, Created) 

DT J5-J f jL-2004 (TrEMBLrel. 27, Last sequence update) 

D r J C3-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DZ "0CC443677 protein (Fragment) . 

GN Name=LOC44 3 677; 

OS Xenopus laevis (African clawed frog) . 

OC jJukaryora; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Zenopcdinae; Xenopus. 

OX . NCBI_TaxID=8355; 

RN [1] 

?.? SEQUENCE FROM N.A. 

RC TISSUE-Embryo; 

RX Mf:DLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strsuisberg R.L., Feingold E.A., Grouse L.H.. Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., . 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F.-, 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L. , Scheetz. T . E . , 

RA Brown3cein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, . 

RA Raha 3.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bonak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J. , Lu X., Gibbs R.A., 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M . , Madan A., Young A.C. , Shevchenko Y., Bouffard G.G., 



RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A. , Schein J.E., 

RA Jones S.J., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=22341132; PubMed=124 54 917 ; DOI=10 . 1002/dvdy . 10174 ; 

RA Klein S.L., Strausberg R.L., Wagner L . , Pontius J., Clifton S.W., 

RA Richardson P. ; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative . " ; 

RL Dev. Dyn. 225:384-391(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RA Klein S., Gerhard D.S.; 

RL Submitted (JUN-2004) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL ; BC073616; AAH73616.1; 

DR GO; GO: 0004437; F: inositol or phosphatidylinositol phosphatas . . .; IEA. 

DR Inter Pro; IPR00034 5; CytC_heme_BS . 

DR xnterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc. 

DR ?fam; PF03372; Exo_endo_phos ; 1. 1 

DR SMART; SM00128; IPPC; 1. 

DR PROSITE; PS00190; C YTOCHROME_C ; UNKN0WN_1 . 

FT NON_TER 1 1 

SO SEQUENCE 496 AA; 56894 MW; 8323071797 8C7D63 CRC64 ; 

Query Match 55.2%; Score 1115; DB 2; Length 496; 

Best Local Similarity 53.1%; Pred. No. 1.9e-87; 

Matches 2 03; Conservative 65; Mismatches 100; Indels 14; Gaps 3; 

Qy 1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

!l = hl -hi = 1 = 11 = 111 i 1 = 11 = 1= = ! = I I I I I I t I t I I I I 1 : = 

Db 112 MD I LAPHGYVKLS S I RLQGLLLLTF VKHQH I P F VQD I RTNY I RTGLFG YWGNKGGVTVRM 171 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 12 0 

:|h : :|llll h I Mh hhh I = : 1111111= llhlll 
Db 172 SVYGHMICFMNCHLPAHMENTNQRLDDFERMLDTQQFDDDNTGNILDHDLVFCFGDLNFR 231 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 18 0 

I 111 = 11 = 11 = 1 = I llllllhllll = 1= I II I I lllllhlh I 

Db 232 ITDFGIHFIREAINSNKYNLLWEKDQLNIAKKKEQFLQGFLEGPLKFKPTYKFDQNSDVY 2 91 

Oy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHF SLSLRGYSSHMTYGISDHK 236 

III II II I 'I I I II hi I : I I II =11 hllhlllllll 

Db 202 DTSSKKRKPAWTDRILWKLK- -DVSGSDPGETTGSDFEEFLKVSLDKYTSHMSYGISDHK 349 

Qy 237 PVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRD 296 

II = 111 hlllh I = I II I = = | = = 111 Mill IIIIMhIl I 

Db 350 PVTGTFTLQLKPLILTPSVTLKPEGEWNADHDSLISYSVAKDFPSSTWDWIGLYRVGFRH 4 09 

Qy 297 VNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQIP 356 



MIMIII | ::| h: MM- Ml 111111= = = 1 = : MMMM 

Db 410 TNDYVTYAWVKDDEISFSEDCNQVYMNADDIPLDGGEFLLCYHCHNLQCLAGISQPFQIR 469 



Qy 3 57 PG SLREDPLGEAQP 3 70 

II II I I 

Db 470 PGRNLGLKEEENRESPFKTESP 4 91 



RESULT 4 
P5PA_MOUSE 

ID P5PA_MOUSE STANDARD; PRT; 1003 AA. 

AC P59644; 

DT 10-OCT-2003 (Rel . 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Fhosphatidylinositol 4 , 5-bisphosphate 5 -phosphatase A (EC 3.1.3.56)-. 

GN Name=Pib5pa; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE. FROM N. A. 

RC 3TRAIN=C57BL/6J; TISSUE=Diencephalon ; 

RX WEOIjI:'T3=223546C3; PubMed=124 66851 ; DOI = 10 . 1038/nature01266 ; 

RA Okazaki Y . , Furuno M . , Kasukawa T., Adachi J., Bono H., Kondo G . , 

RA Nikaido I., Osato N., Saito R., Suzuki H. , Yamanaka I., Kiyosawa H., 

RA Yagi K. , Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C. , Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H. , Batalov S., Beisel K.W., 

RA Blake J. A. , Bradt D. f Brusic V., Chothia C, Corbani L.E., Cousins S., 

RA Dalla E . , Dragani T.A. , Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T. , Gariboldi M . , Gissi C. , Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N . , Jackson I.J., Jarvis E.D., 

RA Kaiiai A., Kawaji H. , Kawasawa Y. , Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A. , 

RA Maglott D.R., Maltais L,, Marchionni L., McKenzie L., Miki H., 

RA IJa ; ;ashima T., Numata K. , Okido T. , Pavan W.J., Pertea G., Pesole G . , 

RA Pctrovsky N. , Pillai R. , Pontius J.U., Qi D. , Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C, Semple C.A. , Setou M . , Shimada K. , 

RA Sultana R., Takenaka Y. # Taylor M.S., Teasdale R . D . , Tomita M . , 

RA Verardo R., Wagner L. , Wahlestedt C, Wang Y. , Watanabe Y. , Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L., 

RA Yuan Z., Zavolan M . , Zhu Y. , Zimmer A., Carninci P., Hayatsu N. , 

RA Hirozane-Kishikawa T. , Konno H., Nakamura M . , Sakazume N. , Sato K. , 

RA Shiraki T. , Waki K. , Kawai J., Aizawa K. , Arakawa T., Fukuda S., 

RA Hara A., Hashizume W. , Imotani K. , Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K. , Sasaki D. , Shibata K. , Shinagawa A., 

RA Yasunishi A., Yoshino M . , Waterston R., Lander E.S., Rogers J., 

RA Birney E., Hayashizaki Y.; 

RT ''Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [2] 

RP SUBCELLULAR LOCATION, AND IDENTIFICATION OF DOMAIN REQUIRED FOR 

RP MEMBRANE RUFFLE LOCALIZATION. 



RX MEDLXNE=22538472; PubMed=12536145 ; DOI=10 . 1074/ jbc . M209991200 ; 

RA Gurung R . , Tan A. , Ooms L.M., McGrath M.J., Huysmans R.D., 

RA Munday A.D. , Prescott M . , Whisstock J.C., Mitchell C.A.; 

RT "Identification of a novel domain in two mammalian inositol - 

RT polyphosphate 5 -phosphatases that mediates membrane ruffle 

RT localization. The inositol 5-phosphatase SKIP localizes to the 

RT endoplasmic reticulum and translocates to membrane ruffles following 

RT epidermal growth factor stimulation."; 

RL J. Biol. Chem. 278:11376-113 85(2003). 

CC -!- FUNCTION: Inositol 5-phosphatase, which converts inositol 1,4,5- 

CC trisphosphate to inositol 1 , 4 -bisphosphate . Also converts 

CC phosphatidylinositol 4 , 5 -bisphosphate to phosphatidylinositol 4- 

CC phosphate and inositol 1 , 3 , 4 , 5 - tetrakisphosphate to inositol 

CC 1, 3 , 4 -trisphosphate in vitro. May be involved in modulation of the 

CC function of inositol and phosphatidylinositol polyphosphate - 

CC binding proteins that are present at membranes ruffles (By 

CC similarity) . 

CC -!- CATALYTIC ACTIVITY: D-myo- inositol 1 , 4 , 5 - trisphosphate + H(2)0 = 
CC myo- inositol 1 , 4 -bisphosphate + phosphate. 

CC -!- CATALYTIC ACTIVITY: lD-myo- inositol 1 , 3 , 4 , 5 -tetrakisphosphate + 
CC H(2)0 = lD-myo-inositol 1 , 3 , 4 -trisphosphate + phosphate. 

CC -1- SUBCELLULAR LOCATION: Cytoplasmic; predominantly localized to 
CC membrane ruffles. 

CC -!- DOMAIN: The 5 Arg-Ser-Xaa -Ser-Xaa-Xaa (RSXSXX) motifs may 
CC constitute binding sites for the 14-3-3 protein. 

CC -!- PTM: Phosphorylated at Ser/Thr residues (By similarity). 

CC - ! - SIMILARITY: Belongs to che inositol - 1 , 4 , 5 -trisphosphate 5- 
C'j phosphatase type II family. 

QC 

CC; This SWISS-PROT entry is copyright. It . is produced through a collaboration 

C'J between the Swiss Institute of Bioinf ormatics and the EMBL out scat ion - 

CC the European Bioinf ormatics Institute. There are no restrictions on its ■ 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial: 

CC entitie:* requires a license agreement (See http : //www . isb-sib . ch/announce/. 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL ; AK034272; BAC28654.1; -. 

DR I13SP; 043001; 1I9Z. 

DR MGD; MGI: 2158663; Pib5pa . 

DR InterPro; IPR005135; Exo_endo_jphos . 

DR InterPro; IPRC00300; IPPc. 

DR Pfan; PF03372; Exo_endo_phos ; 1. 

DR 3 MART; SM00128; IPPc; 1. - ■ 

KW Hydrolase; Phosphorylation; Repeat; SH3 -binding. 



FT 


DOMAIN 


422 


725 


Catalytic (Potential) . 


FT 


DOMAIN 


726 


837 


Required for ruffle localization 


FT 


DOMAIN 


50 


374 


Pro-rich . 




■ DOMAIN 


837 


934 


Ser-rich . 


FT 


SITE 


6 


11 


RSXSXX motif 1. 


FT 


SITE 


351 


356 


RSXSXX motif 2 . 


FT 


SITE 


871 


876 


RSXSXX motif 3 . 


FT 


SITS 


882 


887 


RSXSXX motif 4 . 


FT 


SITE 


908 


913 


RSXSXX motif 5. 


FT 


SITE 


346 


351 


SH3 -binding (Potential) . 


SQ 


SEQUENCE 


1003 


AA; 107602 


MW; AFF4FD92 9CFCB6BA CRC64 ; 



Query Match 43.2%; Score 873.5; DB 1; 

Best Local Similarity 47.5%; Pred . No. 3.3e-66; 
Matches 169; Conservative 56; Mismatches 126; 



Length 1003; 

Indels 5; Gaps 4; 



Qy 1 MDVLS PLS F I KVSHVRMQG I LLLVFAKYQHLPY IQILSTKSTPTGLFGYWGNKGGVNI CL 60 

II I I :h || I I I I I I I H I I I 111 = = - = 1 I III llllllllh: I 
Db 485 MDALGPFNFVLVSTVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 544 

Qy 61 KLYGYYVS I INCHLPPHI SNNYQRLEHFDRILEMQNCEGRDI PNILDHDLI I WFGDMNFR 120 

= 1= : MMM h M -I II :| M III Ih I Nihil 

Db 545 AAFGHMLCFLNCHLP AHMDKAEQRKDNFQT I LS LQQFQG PGAHG I LDHDLVF WFGDLNFR 604 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II = Ihh :| : Hill hh hh MM I I Ihlll * I I 

Db 605 IESYDLHFVKFAIDSNQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 664 

Qy 181 DTSEKKRKPAWTDRI LWRLKRQPCAGPDTPI PPASH - FSLSLRG YS SHMTYGI SDHKPVS 239 

Ml I I I I I I h I I I I hh I M h II = = I III I hllllh 

Db 665 DTSAKKRKPAWTDRI LWKVK- APSGGP -SPSGRESHRLQVTQHSYRSHMEYTVSDHKPVA 722 

Qy 240 GTFDLELKPLVS APL I VLMPEDLWTVENDMMVS YS STSDFPS S PWDW IGLYKVGLRDVND 299 

I h Ih I M Ml • I I II I I I I h I I I | 

Db 723 AQFILQFAFRDDVPLVRLEVADEWARPEQAWRYRVETVFARSSWDWIGLYRVGFRHCKD 782 

Qy 3 00 YVSY7\WVGDSKVSC3DNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRFFQI 355 

MM I M h II = = l MM Hh =M = = Mil 

Dh 733 YVAYVWAKHE.SV - - DGN I YQVT FSEE S L PKGHGD F I LG Y YSHKHS I L I GVTE F FQI 336 



RESULT 5 
?5PA_RAT 

ID P5PA_RAT STANDARD; FRT; 1001 AA. 

AC Q9 JMC1 ; 

DT 10-CCT-2003 (Rel. 42, Created) 

DT : ; .0-OCT-2003 (Rel. 42, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Phosphatidylinositol 4 , 5 -bisphosphate 5 -phosphatase A (EC 3.1.3.56) 

(Proline-rich inositol polyphosphate 5-phosphatase) . 

GN Name=Pib5pa; Synonyms - Pi pp; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N. A., SUBCELLULAR LOCATION, TISSUE SPECIFICITY, AND 

RP ENZYMATIC ACTIVITY. 

RC TISSUE-Brain; 

RX MEDLINE=20062911; PubMed=10593988 ; DOI=10 . 1074/ jbc . 274 . 51 . 36790 ; 

RA Mochizuki Y., Takenawa T. ; 

RT "Novel inositol polyphosphate 5-phosphatase localizes at membrane 

RT ruffles."; 

RIi -7. Biol. Chem. 274:36790-36795(1999). 

CC -!- FUNCTION: Inositol 5-phosphatase, which converts inositol 1,4,5- 

CC trisphosphate to inositol 1 , 4 -bisphosphate . Also converts 

CC phosphatidylinositol 4 , 5 -bisphosphate to phosphatidylinositol 4- 

CC phosphate and inositol 1 , 3 , 4 , 5 -tetrakisphosphate to inositol 

CC 1 , 3 , 4 -trisphosphate in vitro. May be involved in modulation of the 



CC function of inositol and phosphatidylinositol polyphosphate - 

CC binding proteins that are present at membranes ruffles. 

CC -!- CATALYTIC ACTIVITY: D-myo- inositol 1 , 4 , 5 - trisphosphate + H(2)0 = 
CC myo-inositol 1 , 4 -bisphosphate + phosphate. 

CC -!- CATALYTIC ACTIVITY: ID-myo-inosi tol 1 , 3 , 4 , 5 - tetrakisphosphate + 
CC H(2)0 = lD-myo- inositol 1 , 3 , 4 -trisphosphate + phosphate. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic; predominantly localized to 
CC membrane ruffles. 

CC -!- TISSUE SPECIFICITY: Expressed in heart, brain, kidney, stomach, 
CC small intestine and lung. Not expressed in spleen, thymus, 

CC skeletal muscle, testis and skin. 

CC -!- DOMAIN: The 5 Arg-Ser-Xaa-Ser-Xaa-Xaa (RSXSXX) motifs may 
CC constitute binding sites for the 14-3-3 protein. 

CC -!- PTM: Phosphorylated on Ser/Thr residues. 

CC -{-SIMILARITY: Belongs to the inositol - 1 , 4 , 5 -trisphosphate 5- 
CC phosphatase type II family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



CC 
DR 


EMBL; ABO 3 


2551; 


BAA90553 . 


l; 






DR " 


HSSP; 043001; 1I9Z . 








DR 


InizerPro ; 


IPR005135; Exo_ 


endo_j?hos . 




DR 


Inter Pro ; 


IPR000300; IPPc 








DR 


Pfam; PF03372; Exo_endo_phos 


; 1. 




DR 


SMART; 3M0012 8; 


IPPc; 1. 








KW 


Hydrolase ; 


Phosphorylation ,* 


Repeat; SH3 -binding. 


FT 


DOMAIN 


420 


723 




Catalytic (Potential) . 


FT 


DOMAIN 


724 


835 




Required for 


ruffle localization 


FT 










similarity) . 




FT 


DOMAIN 


50 


387 




Pro-rich. 




FT 


DOMAIN 


835 


932 




Ser-rich. 




FT 


SITE 


6 


11 




RSXSXX motif 


1. 


FT 


SITE 


351 


356 




RSXSXX motif 


2 . 


FT 


SITE 


869 


874 




RSXSXX motif 


3. 


FT 


SITE 


880 


885 




RSXSXX motif 


4 . 


FT 


SITE 


906 


911 




RSXSXX motif 


5 . 


FT 


SITE 


346 


351 




SH3 -binding 


(Potential) . 


SQ 


SEQUENCE 


1001 


AA; 107207 


MW; 7BE7741FEF8F3FAB CRC64 ; 



Query Match 43.2%; Score 872.5; DB 1; 

Best Lccal Similarity 47.5%; Pred. No. 4e-66; 
Matches 169; Conservative 56; Mismatches 126; 



Length 1001; 
Indels 5; 



Gaps 



4; 



Qy 
Db 

QY 
Db 



1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I I M I i I I I = ^ I ! = I I I I 111 = = = I I III MINIMI:: | 

4 83 MDALGPFNFVLVSTVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 542 

61 KLYGYYVS I INCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLI IWFGDMNFR 120 

:h : MIMI h II = :| II M M HIMh lllhlll 

543 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAHGILDHDLVFWFGDLNFR 602 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : Mlh :| = IIIIMhMI I : I = I I I II I I h I I I I I 

603 IESYDLHFVKFAIDSNQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 662 

181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 23 9 

III 111111111111-1 I || :| II I Ml I HI Mlh 

663 DTS AKKRKPAWTDR I LWKVK - APS GGP - S PSGRESHRLQVTQHS YRSHME YTVSDHKPVA 72 0 

240 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 299 

Ih Ih | | | =| I : I I MIIMhll I I 

721 ARFLLQFAFRDDVPLVRLEVADEWARPEQAWRYRVETVFARSSWDWIGLYRVGFRHCKD 780 

300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

Ihl I =1 h II = = l MM llh -I- MM 

781 YVAYVWAKH3EV--DGNIYQVTFSEE3LPKGHGDFILGYYSHHHSILIGVTEPFQI 834 



RESULT 
Q6ZTH6 



ID 
AC 
DT 
DT 
JjT 
DE 
OS 



li? 
?.C 
RA 
RA 
RA 
RA 
RA 
RA 
?.A 
RA 
RL 
DR 
DP. 
DR 
DR 
DR 
DR 
SQ 



Q62TH6 PRELIMINARY; PRT; 638 AA . 

Q6ZTH6 ; 

05-JUL-2004 (TrEMBLrel. 27, Created) 
05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 
05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 
Hypothetical protein FLJ44647. 
Homo sapiens (Human) . 
^ukaryota; Metazoa; Chordata; 

Primates ; 



Craniafca; Vertebrata; Euteleost omi ; 
Catarrhini; Hominidae; Homo. 



Mammalia ; Eutheria ; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N. A. 
TISSUE=Cerebellum; 

Tanigami A., Fujiwara T., Shibahara T. , Goto Y., H.irao M., Shimizu F. , 
Wakebe H. , One T. , Hishigaki H., Watanabe T. , Gzaki K. , Sugiyama T., 
Irie R. , Otsuki T., Sato H. , Wakamatsu A., Ishii S., Yamamoto J., 
Isono Y., Kawai-Hio Y., Saito K. , Nishikawa T., Kimura K. , 
Yamashita H., Matsuo K. , Nakamura Y. , Sekine M., Kikuchi H., Kanda K. , 
VJagatsuma M. , Murakawa K. , Kanehori K. , Takahashi -Fuj ii A., Oshima A., 
Sugiyama A. , Kawakami B., Suzuki Y., Sugano S., Nagahari K. , 
Masuho Y., Nagai K. , Isogai T.; 

Submitted (JUL-2 003) to the EMBL/ GenBank/DDBJ databases. 
SMBL; AK126610; BAC86611.1; -. 

GO; GO:0004437; F:inositol or phosphatidylinositol phosphatas . . .; IEA. 
InterPro; IPR005135; Exo_endo _phos . 
* Int:erPro; IPR000300; IPPc. 
Tfarn; PF03372; Exo_endo_phos ; 1. 
SMART; SM00128; IPPC; 1. 

SEQUENCE 638 AA; 70211 MW; EE7EBE8F0960D2 8C CRC64 ; 



Query Match 43.1%; Score 870.5; DB 2; Length 638; 

Best Ijocal Similarity 47.8%; Pred. No. 3.3e-66; 

Matches 170; Conservative 54; Mismatches 127; Indels 5; 



Gaps 



4; 



Qy 

Db 



1 MDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICL 60 

II I I = I * II I M I M M I M I M III-: I I III IMIMIII:: I 
12 0 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 17 9 



Qy 61 KL YG Y YVS I INCHLPPH I SNNYQRLEHFDRI LEMQNCEGRD I PN I LDHDL I I WFGDMNFR 120 

= 1= = MMM h II : = | II =| I I MM h II I hill 

Db 180 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHDLVFWFGDLNFR 23 9 

Qy 121 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : MM: | : IIMIIMMI MM MM I I Ihlll I I 

Db 24 0 IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDAGTNKY 2 99 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

Ml MMMMMIMM I II M 1 1 == ' I Ml I MMMM 

Db 300 DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTVSDHKPVA 357 

Qv 24 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 

I M Ihlll Ml - I I 1 1 1 1 1 M = E I I I 

Db 3 58 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 417 

Qy 3 00 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

MM I I I II = = l MM 111 = = = = lh MM 

Db 418 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNKSILIGITEPFQI 471 



RESULT 7 
PEPA_HUMAN 

ID P5PA_HUMAN STANDARD; PRT; 10 06 AA. 

AC Q1573 5; Q8N902; Q9UDT9; 

DT 10-OCT-2003 (Rel . 42, Created) 

DT 10-CCT-2003 (Rel. 42, Last sequence update) 

.DT o;;- JUL-2004 (Rel. 44, Last annotation update) 

DE Phosphatidyl inositol 4 , 5 -bisphosphate 5-phosphatase A (EC 3.1.3.56); 

GN Name =2? IB 5 PA; Synonyms =P I PP ; 

OS I-Iomo sapiens (Human) . 

OC Sukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_Ta:cID=9606; 

RN il] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSU£=Heart; 

RX PubMed=14702039; DOI=10 . 1038/ngl285 ; 

RP. Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T. , Irie R . , 

RA Wakamatsu A., Hayashi K. , Sato H., Nagai K. , Kimura K. , Makita H., 

RA Sekine M . , Obayashi M . , Nishi T., Shibahara T. , Tanaka T., Ishii S., 

RA Yamamoto J. -I., Saito K. , Kawai Y. , Isono Y., Nakamura Y., 

RA Nayahari K. , Murakami K. , Yasuda T. , Iwayanagi T. , Wagatsuma M. , 

RA Shiratori A., Sudo H. , Hosoiri T., Kaku Y . , Kodaira H. , Kondo H. , 

RA Sugawara M. # Takahashi M. , Kanda K. , Yokoi T. , Furuya T., Kikkawa E., 

RA Omura Y., Abe K. , Kamihara K. , Katsuta N. , Sato K. , Tanikawa M. , 

RA Yamazaki M. , Ninomiya K. , Ishibashi T., Yamashita H., Murakawa K. , 

RA Fujimori K. , Tanai H. , Kimata M., Watanabe M., Hiraoka S., Chiba Y. , 

RA Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M . , Hotuta T., 

RA Kusano J., Kanehori K., Takahashi -Fuj ii A., Hara H. , Tanase T.-O., 

RA Nomura Y. , Togiya 3., Komai F . , Hara R. , Takeuchi K. , Arita M., 

RA Imose N . , Musashino K., Yuuki H., Oshima A., Sasaki N. , Aotsuka S., 

RA Yoshikawa Y., Matsunawa H., Ichihara T. , Shiohata N . , Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., 

RA Nakagawa S., Senoh A., Mizoguchi H. , Goto Y., Shimizu F., Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M. , Kawakami B., 

RA Yamazaki M . , Watanabe K. , Kumagai A., Itakura S., Fukuzumi Y., 



RA Fujimori Y., Komiyama M . , Tashiro H., Tanigami A., Fujiwara T. , 

RA Ono T., Yamada K. , Fujii Y., Ozaki K. , Hirao M., Ohmori Y. , 

RA Kawabata A., Hikiji T. , Kobatake N. , Inagaki H., Ikema Y., Okamoto S. 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T., Shigeta K. , Senba T., 

RA Matsumura K. , Nakajima Y., Mizuno T., Morinaga M . , Sasaki M. , 

RA Togashi T. , Oyama M. , Hata H., Watanabe M . , Komatsu T., 

RA Mizushima-Sugano J,, Satoh T., Shirai Y. , Takahashi Y., Nakagawa K., 

RA Okumura K. , Nagase T. , Nomura N. , Kikuchi H . , Masuho Y. , Yamashita R. 

RA Nakai K. , Yada T., Nakamura Y. , Ohara 0. , Isogai T . , Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT CDNAS . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [2] 

RP SEQUENCE FROM N . A . (ISOFORM 1) . 

RX MEDLINE=20057165; PubMed=10591208 ; DOI=l 0 . 103 8/ 99003 1 ; 

RA Dunham I., Hunt A . R . , Collins J.E., Bruskiewich R. , Bears D.M., 

RA Clamp M. , Smink L.J. f Ainscough R. , Almeida J. P., Babbage A.K., 

RA Bagguley C, Bailey J., Barlow K.F., Bates K.N. , Beasley O.P., 

RA . Bird CP., Blakey S.E., Bridgeman A.M., Buck D., Burgess J., 

RA Burrill W.D., Burton J., Carder C, Carter N.P., Chen Y., Clark G. ; 

RA Clegg S.M., Cobley V.E., Cole C.G., Collier R.E., Connor R., 

RA Conroy D. , Corby N.R., Coville G.J., Cox A.V. , Davis J., Dawson E . , 

RA Dhami P.D., Dockree C, Dodsworth S.J., Durbin R.M., Ellington A.G., 

RA Hvans K.L., Fey J.M., Fleming K. , French L. , Garner A. A. , 

RA Gilbert J.G.R., Goward M.E., Grafham D.V. , Griffiths M.N.D., Hall C. , 

RA Hall R.E., Hall-Tamlyn G., Heathcott R.W., Ho S., Holmes S., 

RA Hunt S.E., Jones M.C., Kershav/ J., Kdmberley A.M., King A.. 

RA Laird G.K., Lanyford C.F., Leversha M.A., Lloyd C. . Lloyd D.M., 

RA M.?.rtyri I.D., Mashreghi -Mohammadi M. , Matthews L.H., Mccann O.T., 

RA Mcclay J. > Mclaren S., McMurray A. A. , Milne S.A., Mortimore B.J., 

RA Odell C.N., Pavitt R. , Pearce A.V. , Pearson D., Phillimore B.J.C.T., 

RA Phillips S.H., Plumb R.W., Ramsay H . , Ramsey Y., Rogers L., Ross M.T. 

PA Scott C.E., Sehra H.K., Skuce CD., Smalley.S., Smith M.L., 

RA Scderlund C, Spragon L., Steward C.£., Sulston J.E., Swann R.M., 

RA Vaudin M. , Wall M . , Wallis J.M., Whiteley M.N., Willey D.L., 

RA Williams L. , Williams S.A., Williamson H. , Wilmer T.E., Wilming L., 

RA Wright C.L., Hubbard T.,Bentley D.R., Beck S., Rogers J., Shimizu N. 

RA Miroshinia S., Kawasaki K. , Sasaki T. , Asakawa S., Kudoh J . , 

RA Shintani A., Shibuya K. , Yoshizaki Y., Aoki N. t Mitsuyama S., 

RA Roe B.A., Chen F . , Chu L., Crabtree J., Deschamps 3., Do A., Do T., 

RA Dorman A., Fang F., Fu Y. , Hu P., Hua A., Kenton S., Lai H. , Lao H.I. 

RA Lewis J., Lewis S., Lin S.-P., Loh P., Malaj E . , Nguyen T. , Pan H., 

RA Phan S., Qi S., Qian Y., Ray L., Ren Q. , Shaull S., Sloan D . , Song L. 

RA Wang Q. # Wang Y. , Wang Z., White J., Willingham D., Wu H. , Yao Z. # 

RA Zhan M. , Zhang G., Chissoe S., Murray J., Miller N. , Minx P., 

RA Fulton R. , Johnson D. , Bemis G., Bentley D., Bradshaw H., Bourne S., 

RA Cordes M. , Du Z . , Fulton L. , Goela D., Graves T. , Hawkins J., 

RA Hinds K. , Kemp K. , Latreille P., Layman D., Ozersky P., Rohlfing T., • 

RA Schcet P., Walker C, Wamsley A., Wohldmann P., Pepin K. , Nelson J., 

RA Kerf I., Bedell J. A., Hillier L.W., Mardis E. , Waterston R., 

RA Wilson R., Emanuel B.S., Shaikh T. , Kurahashi H. , Saitta S., 

RA 3udarf M.L., McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., 

RA Edelmann L . , Kim U.J., Shizuya H., Simon M.I., Dumanski J. P., 

RA Peyrard M. , Kedra D., Seroussi E . , Fransson I., Tapia I., Bruder C.E. 

RA O'Brien K.P., Wilkinson P., Bodenteich A., Hartman K. , Hu X., 

RA Khan A.S., Lane L . , Tilahun Y., Wright H. ; 

RT "The DNA sequence of human chromosome 22."; 



RL Nature 4 02:489-4 95(1999). 

RN [3] 

RP SEQUENCE OF 648-1044 FROM N.A. 

RC TISSUE=Brain; 

RA Nussbaum R.L.; 

RL Submitted (JAN-1996) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: Inositol 5 -phosphatase , which converts inositol 1,4,5- 

CC trisphosphate to inositol 1 , 4 -bisphosphate . Also converts 

CC phosphatidylinositol 4 , 5 -bisphosphate to phosphatidylinositol 4- 

CC phosphate and inositol 1 , 3 , 4 , 5 -tetrakisphosphate to inositol 

CC 1 , 3 , 4 -trisphosphate in vitro. May be involved in modulation of the 

CC function of inositol and phosphatidylinositol polyphosphate- 

CC binding proteins that are present at membranes ruffles (By 

CC similarity) . 

CC -!- CATALYTIC ACTIVITY: D-myo-inosi tol 1 , 4 , 5 -trisphosphate + H(2)C = 
CC myo- inositol 1 , 4 -bisphosphate i- phosphate. 

CC -!- CATALYTIC ACTIVITY: lD-myo- inositol 1 , 3 , 4 , 5 - tetrakisphosphate + 

CC H(2)0 = lD-myo-inositol 1 , 3 , 4 -trisphosphate + phosphate. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic; predominantly localized to 

CC membrane ruffles (By similarity) . 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q15735-l ; Sequence^Displayed ; 

CC Nr.me = 2; 

CC IsoId=Q15735-2; Sequence=VSP_0072 96 ; 

CC . -!- DOMAIN: The 5 Arg-Ser-Xaa-Ser-Xaa-Xaa (RSXSXX) motifs may ■ 
CC constitute binding sites for the 14-3-3 protein. 

CC -1- PTM: Phosphorylated at Ser/Thr residues (By similarity). 

CC - ! -- SIMILARITY: Belongs to the i nositol - 1 , 4 , 5 - trisphosphate 5- 
CC phosphatase type II family. 

CC -I- CAUTION: Ref . 2 sequence differs from that shown due to erroneous 
CC gene model prediction. 

CC - - r 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and fcr commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AK095944; BAC04 657.1; -. 

DR EMBL; AC005005; AAD15618.1; ALT_SEQ. 

DR EMBL; U45975; AAB03216.1; 

DR HSSP; 043001; 1I9Z. 

DR Genew; HGNC:8956; PIB5PA. 

DR MIM; 606481; 

DR Inter Pro; IPR00513 5; Exo_endo_ jphos . 

DR InterPro; IPR000300; IPPc. 

DR Pfam; PF03372; Exo_endo _phos ; 1. 

DR SMART; SM00128; IPPc; 1. 

KW Alternative splicing; Hydrolase; Phosphorylation; Repeat; SH3 -binding. 

FT DOMAIN 425 728 Catalytic (Potential) . 

FT DOMAIN 729 840 Required for ruffle localization (By 

FT similarity) . 

FT DOMAIN 122 371 Pro-rich. 



FT 


DOMAIN 


840 


937 


Ser-rich . 


FT 


SITE 


102 


107 


RSXSXX motif 1. 


FT 


SITE 


350 


355 


RSXSXX motif 2. 


FT 


SITE 


874 


879 


RSXSXX motif 3. 


FT 


SITE 


885 


890 


RSXSXX motif 4. 


FT 


SITE 


911 


916 


RSXSXX motif 5 . 


FT 


SITE 


345 


350 


SH3-binding (Potential) . 


FT 


VARSPLIC 


1 


367 


Missing (in isoform 2) . 


FT 








/FTId=VSP_007296 . 


FT 


CONFLICT 


596 


596 


E -> D (in Ref . 1) . 


FT 


CONFLICT 


610 


612 


SYD -> ARG (in Ref. 3) . 


3Q 


SEQUENCE 


1006 


AA; 107210 


MW; DCFDD67F464C6E5D CRC 



Query Match 42.8%; Score 864.5; DB 1; Length 1006; 

Best Local Similarity 47.5%; Pred. No. 2e-65; 

Matches 169; Conservative 55; Mismatches 127; Indels 5; Gaps 4; 

Qy 1 MDVLS PLSF I KVSHVRMQGI LLLVFAKYQHLP Y IQI LSTKSTPTGLFGYWGNKGGVNI CL 60 

I I I I :h II I I I I h : I I -I I I I III- : I I III 111111111- I 

Db 438 MDALGPFNFVLVSSVRMQGVILLLFAKYYHLPFLRDVQTDCTRTGLGGYWGNKGGVSVRL 54 7 

Qy 61 KLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFR 120 

:h : HUM I II -I I I I : I I I I I : I • lllhlll 

Do 54 6 AAFGHMLCFLNCHLPAHMDKAEQRKDNFQTILSLQQFQGPGAQGILDHELVFWFGDLNFR 607 

Qy 12.1 IEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

II : Mlh U = IMIMI-II hh I I I I I I INN I I 

Db c;0S IESYDLHFVKFAIDSDQLHQLWEKDQLNMAKNTWPILKGFQEGPLNFAPTFKFDVGTNKY 667 • 

Qy '01 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASH-FSLSLRGYSSHMTYGISDHKPVS 239 

III INI II II III I- I I H || :: I III I = I I I I I I s 

Tb i5C DTSAKKRKPAWTDRILWKVK-APGGGP-SPSGRKSHRLQVTQHSYRSHMEYTV3DHKPVA 725 

Qy ;,4 0 GTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVND 2 99 . 

I I : I I : I II = I I = I I I I I I I I h II I I 

Db 726 AQFLLQFAFRDDMPLVRLEVADEWVRPEQAWRYRMETVFARSSWDWIGLYRVGFRHCKD 785 

Qy 300 YVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSWGISRPFQI 355 

I hi I I I II ::| :|:| Ml- : : | | : | || | 

Db 786 YVAYVWAKHEDV- -DGNTYQVTFSEESLPKGHGDFILGYYSHNHSILIGITEPFQI 839 



RESULT 8 
Q7PYY6 

ID Q7PYY6 PRELIMINARY; PRT; 501 AA. 

AC Q7PYY6; 

DT 01-MAR-2004 (TrEMBLrel. 26, Created) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE AgCP9655 (Fragment) . 

GN Name =agCG4 9651; ORFName 3-ENSANGG0 0000016677; 

OS Anopheles gambiae str. PEST. 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles. 

OX NCBI_TaxID=180454 ; 

RN [lj 

RP SEQUENCE FROM N . A. 



RC STRAIN=PEST ; 

RA Anopheles Genome Sequencing Consortium; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

DR EMBL; AAAB01008 986 ; EAA00417.1; -. 

DR HSSP; 043001; 1I9Z. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR Pfam; PF03372; Exo_endo_jphos ; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 501 AA; 57809 MW; F8C7DFBBF3 8FCA7 9 CRC64 ; 

Query Match 25.5%; Score 516; DB 2; Length 501; 

Best Local Similarity 30.7%; Pred. No. 9e-36; 

Matches 116; Conservative 70; Mismatches 152; Indels 40; Gaps 7; 

Qy 2 DVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLK 61 

Ml := : =111 = 11 lih = t I : = = = h I I I I I MM! hi = 

Db 127 EVLKERDYWIKTEQMQGLLLSVFARRKHLLHLRQVETEYTRTGLGGIWGNKGAVSIRMN 186 

Qy 62 L YG YYVS I INCHLPPHI SNNYQRLEHFDR I LEMQNCEGRD I PNI LDHDL I I WFGDMNFRI 121 

Ml = -I M I = 1 = :MI- I = I 111 = I I M M I I : 

Db 137 \^/GSSICLVNAHLAAHDHMLEERINDYERIVQEQKFHVKAKETIFDHDYVFWFGDLNFRL 24 6 • 

Qy 122 EDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKF 173 

MM I II M I : == = II MMMI 

Db 247 TGEATTSADEIRAMVARDELKQ LIEKDQLLLVRREGRAFAQLQERLPQFPPTFKF 301 

Qv "xl<\ DRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGIS 233 - 

: MMI I I = I I I I I I I h : I II M II 

Db P,02 I3HGSNEYD MKRRP AWTDR I LYAVNENN YRN VRLTAEQTSYKSHPSYSIS 350 

Qy 23<1 DHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVG 2 93 - 

!Mih ||:: I M I I =1=1 I I I M M 

Db 351 DHKPVTSEFTLKVYEDTTERTVSFKPIELWLVGEPNTIEYVIPNGFEEGNADWIGIYREN 410 

Qy 294 LRDVNDYVSYAW VGDSKVSCSDNLNQVYIDIS NIPTTEDEFLLCYYS 340 

= : = |::| : : = =: | | = | ::| || : | 

Db 411 FTSLSEYLAYEYTETYKDKQHQLNEQQLQHMQNTRTVQLTFSENVSLPLGTRYQLLYFQS 470 

Qy 341 NSLRSV VGISRPFQI 355 

II MM I 

Db 471 TGTRG VTGL VG I S P AFAV 488 

RESULT 9 
Q9VXE7 

ID 09VXJ37 . PRELIMINARY; PRT; 508 AA. 

AC Q9VXE7; 

DT ol-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE CG97&4-PA (SD21513p) . 

GN ORFNames=CG9784 ; 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 



OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCB I_TaxID= 7227; 

RN [1] 

RP SEQUENCE FROM N. A. 

RX MEDLINE=20196006; PubMed=1073 1132 ; DOI=10 . 1126/ science . 2 87 . 54 6 1 . 2 185 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M., Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X. f 

RA Brandon R.C., Rogers Y.H., Blaze j R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Gabor G.L., 

RA Abril J.F., Agbayani A., An H.J. , Andrews -Pfannkoch C, Baldwin D. , 

RA Ballew R.M, , Basu A., Baxendale J., Bayraktaroglu L . , Beasley E.M., 

RA Beeson K.Y. , Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R. # Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H. , Cadieu E., Center A., Chandra I., 

RA Cherry J.M. , Cawley S., Dahlke C. , Davenport L.B., Davies P., 

RA de Pablos B., Delcher A. , Deng Z., Mays A.D. , Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. f Dugan-Rocha S., Dunkov B.C., Dunn P. 

RA Durbin K. J. , Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., 

RA Harris N.L., Harvey D. , Heiman T.J., Hernandez J.R., Houck J., 

RA Ilostin D . , Houston K.A. , Howland T.J., Wei M.H., Ibegwam C, 

RA Jalali . , Kalush F., Karpen G.H., Ke Z., Kennison J. A. . Ketchum K.A. 

RA Kimrnel 3 . E . , Kodira CD., Xraft C, Kravitz S., Kulp D. , Lai Z., 

RA I.asko y.. Lei Y., Levi t sky A. A. , Li J., Li Z., Liang Y. , Lin X., 

RA Liu X., Mattei B. , Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V. , Mobarry C. , Morris J., Moshrefi A.-, 

RA Mount S.M., Moy M . , Murphy B . , Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA R^inerc K. , Remington K. , Saunders R.D. , Scheeler F. # S.hen H., 

RA 3hue 3.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T. , 

RA Spier 3 . , Spradling A.C., Stapleton M . , Strong R., Sun E., 

RA Svirskas R. , Tector C. , Turner R. , Venter E., Wang A.H. , Wang X., 

RA Wang Z.Y., Wassarman D.A., Weinstock G . M . , Weissenbach J., 

RA Williams S.M., WoodageT, Worley K.C., Wu D., Yang S., Yao Q.A. , Ye J. 

RA ' ¥eh R.F., .Zaveri J.S., Zhan M . , Zhang G., Zhao Q. , Zheng L. , 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O. 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster . " ; 

RL Science 237:2185-2195(2000). 

RN [2] 

RP SEQUI^CE FROM N. A. 

RX MEDLINE=22426065 ; PubMed=12 537568 ; 

RA Celniker S.E., Wheeler D.A. , Kronmiller B., Carlson J.W. , Halpern A.-, 

RA P:\tel S., Adams M . , Champe M. , Dugan S.P., Frise E., Hodgson A. , 

RA George R.A., Hoskins R.A., Laverty T . , Muzny D.M., Nelson C.R., 

RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., 

RA Svirukcs R. , Tabor P.E., Wan K. , Stapleton M. , Sutton G.G., Venter C. 

RA Weinstock G. . Scherer S.E., Myers E.W., Gibbs R.A. , Rubin G.M.; 

RT "Finishing a whole-genome shotgun: Release 3 of the Drosophila 

RT melanogaster euchromatic genome sequence."; 

RL Genome 3iol. 3 :RESEARCH0079-RESEARCH0079 (2002 ) . 

RN [3] 



RP SEQUENCE FROM N.A. 

RX MEDLINE=22426070; PubMed=1253 7573 ; 

RA Kaminker J.S., Bergman CM., Kronmiller B., Carlson J., Svirskas R., 

RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., 

RA Ashburner M . , Celniker S.E.; 

RT "The transposable elements of the Drosophila melanogaster euchromatin: 

RT a genomics perspective."; 

RL Genome Biol. 3 : RESEARCH0084 -RESEARCH0084 (2 002 ) . 

RN [4] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22426069; PubMed=1253 7572 ; 

RA Misra S., Crosby M.A., Mungall C.J., Matthews 3.B., Campbell K.S., 

RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H. , Prochnik S.E., 

RA Smith CD., Tupy J.L., Whitfied 3. J., Bayraktaroglu L . , Berman B.P., 

RA 3ettencourt B.R., Celniker S.E., de Grey A.D. , Drysdale R.A. , 

RA Harris N.L., Richter J., Russo 3. # Schroeder A.J. # Shu S.Q., 

RA Stapleton M . , Yamada C, Ashburner M . , Gelbart W.M., Rubin G.M., 

RA Lewis S . E . ; 

RT "Annotation of the Drosophila melanogaster euchromatic genome: a 

RT systematic review."; 

RL Genome Biol. 3 : RESEARCH0083 -RESEARCH0083 (2002 ) . 

RN [5] 

RP SEQUENCE FROM N.A. * 

RG FlyEase; 

RL Submitted (SEP-2002) to the EMBL/GenBank/DDBJ databases. 

RN [6} 

RP l^QUENCS FROM N.A. 

RG :.'_V/Base ; ■ • 

RL Submitted (MAR-2004) to the EMBL/GenBank/DDBJ databases. 

?_>7 l7] 

R? SEQUENCE FROM N.A. 

ilA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J.,' 

RA Champe M. , Chavez C. , Dorsett V., Dresnek D. r Farfan D., Frise E., 

RA G30i*ge R. , Gonzalez M., Guarin H. , Kronmiller B., Li P., Liao G. , 

RA Miranda A. , Mungall C.J., Nunoo J., Pacleb J. , Paragas V., Park S.> 

RA Patel S., Phouanenavong S., Wan K. , Yu C, Lewis S.E., Rubin G.M. , 

RA Celniker S . ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

DR 2M3L; AE003502; AAF48627.1; 

DR EMBL; AY119249; AAM51109.1; -. 

DR HSSP;- 043001; 1I9Z. 

DR In t Act. Q9VXE7 ; - . 

DR FlyBase; FBgn003 0761 ; CG9784. 

DR GO; GO:0004437; F:inositol or phosphatidylinositol phosphatas . . .; IEA. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc. 

DR Pfam; PF03372; Exo_endo_phos ; 1. 

DR SMART; SM0012 8; IPPC; 1. 

SQ SEQUENCE 508 AA; 58973 MW; F700 8010B55AD722 CRC64 ; 



Qvory Match 25.1%; Score 507.5; DB 2; Length 508; 

2csc Local Similarity 34.3%; Pred. No. 5e-35; 

Matches 111; Conservative 62; Mismatches 130; Indels 21; Gaps 7; 

Qy 2 DVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLK 61 

-I =: I = I I I = J t =1 : Ih ::| : : I II I Mill | = = 

Db 106 ELLRNYDYVAVKTEQMQGLLLSMFVRRQHVEHLQDIEAEFTRTGFGGIWGNKGAVSVRFT 165 



Qy 62 L YG Y YVS I INCHLPPH I SNN YQRLEHFDRI LEMQNCEGRD I PN I LDHDL I I WFGDMNFR I 121 

I I I - : II I :| = | = =111 : = I III = lllhllh 

Db 166 LYGCGLAFVVAHLTAHDHMMDERI ED YKQI LENHH YHVKRYRE I YDHD YVFWFGDLNFRL 225 

Qy 122 E - -DFGLHFVRES IKNRC - YGGLWEKDQL - S I AKKHDPLLREFQEGRLLFPPTYKFDRNS 177 

: I Ml : = = : | = = ||| = =| = || Nihil = 

Db 226 QGSDSSTE-VRELVRDESQHEALIQRDQLYQVREKSQLAFQVLQERLPAFPPTFKFREGT 2 84 

Qy 178 NDYDTSEKKRKPAWTDRILWR LKRQPCAGPDTP I PPASHFSLSLRGYS SHMTYGI SD 234 

::|| Ihllllllh: I II I h I II I III 

Db 285 SEYDL KRRPAWTDRIMYAVQPLNRQ PGMQLS IEQCSYKSHPLYTISD 331 

Qy 23 5 HKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGL 2 94 

Mil: I = = l I I II H I II = == II = = l MM- 
Db 332 HKPVTSDFTIKLYPNVRAPGVVFSPLSLWKIGDENTVEYHKQAEFDEGSIJDWIGIFPSEY 391 

Qy 2 95 RD VND YVS YAWVGDS KVS CS DNLN 318 

: Mhl M - I = I 

Db 3 92 ASLADY VAYE YVNQAES PS S SDSN 415 

RESULT 10 
Q9V7X0 

ID Q9VV;;0 PRELIMINARY; PRT; 357 AA. 

AC Q9V7X0; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created)- 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE CG6805-PA (LD06095p) . 

GN GRFNames=CG68 05; 

OS Drosophila melanogaster (Fruit fly) . 

OC ?:ukaryota; Metazoa,* Arthropoda; Hexapoda; Insecta; Pterygota; 
OC Neoptera; . Endopteryqota; Diptera; Brachycera; Muscomorpha; 

OC • Sphyrtroidea; Drosophilidae ; Drosophila. 
OX NCBI_TaxID=7227; 
RH [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20196006; PubMed=1073 1132 ; DOI=10 . 1126/science . 287 . 5461 . 2185 ; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M. # Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D. , Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.H., Blazej R.G. , Champe M. , Pfeiffer B.D., 

RA y/an K.H., Doyle C, Baxter E.G., Helt G. , Nelson C.R. , Gabor G . L . , * 

RA Abril J.F., Agbayani A., An H.J. , Andrews -Pfannkoch C, Baldwin D., 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L. , Beasley E.M., 

RA Beeson K.Y. , Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H. , Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A. , Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K. J. , Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W., 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F., Gorrell J.H., Gu Z . , Guan P., Harris M . , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 



RA Hostin D., Houston K.A. , Howland T.J., Wei M.H., Ibegwam C, 

RA Jalali M . , Kalush F. , Karpen G.H., Ke Z., Kennison J. A. , Ketchum K.A. 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D. , Lai Z., 

RA Lasko P., Lei Y. , Levitsky A. A. , Li J., Li Z., Liang Y. , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G. , Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J. , Puri V., Reese M.G., 

RA Reinert K. , Remington K. , Saunders R.D., Scheeler F. , Shen H. , 

RA Shue B.C., Siden-Kiamos I., Simpson M . , Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M . , Strong R. , Sun E., 

RA Svirskas R. , Tector C. Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.Y., Wassarman D.A., Weinstock G.M. , Weissenbach J., 

RA Williams S.M., WoodageT, Worley K.C., Wu D . , Yang S., Yao Q.A. ,. Ye J. 

RA Yeh R.F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q. , Zheng L . , 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O. 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster."; 

RL Science 287:2185-2195(2000). 

Rtt [2] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=22426065; PubMed=12537568 ; 

RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., 

IIA Patol S., Adams M. , Champe M. , Dugan S.P., Frise E . , Hodgson A., 

RA George R.A., Hoskins R.A., Laverty T. , Muzny D.M., Nelson C.R., 

RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.0 r ., 

RA Svirskas R. , Tabor P.E., Wan K. , Stapleton M . , Sutton G.G., Venter C. 

RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; 

RT "Finishing a whole -genome shotgun: Release 3 of the Drosophila 

RT melanogaster euchromatic genome sequence."; 

RL Genome Biol. 3 :RESEARCH0079-RESEARCH0G79 (2002) . 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22426070; PubMed=12537573 ; 

RA Kaminker J.S., Bergman CM., Kronmiller B., Carlson J., Svirskas R. , 

RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., 

RA Ashburner M., Celniker S.E.; 

RT "The , transposable elements of the Drosophila melanogaster eiichromatin 

RT a genomics perspective."; 

RL Genome Biol. 3 : RESEARCH0084 -RESEARCH0084 (2 002) . 

RN [4] 

R? SEQUENCE FROM N.A. 

RX MZDLINE=22426069; PubMed=12537572 ; 

RA Misra S., Crosby M.A., Mungall C. J. , Matthews B.B., Campbell K.S., 

RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., 

RA Smith CD., Tupy J.L., Whitfied E.J., Bayraktaroglu L., Berman B.P., 

RA Bettencourt B.R., Celniker S.E., de Grey A.D., Drysdale R.A., 

RA Harris N.L. , Richter J., Russo S., Schroeder A.J., Shu S.Q., 

RA Stapleton M. , Yamada C, Ashburner M. , Gelbart W.M., Rubin G.M., 

RA Lewis S . E . ; 

RT "Annotation of the Drosophila melanogaster euchromatic genome: a 

RT systematic review."; 

RL Genome Biol. 3 :RESEARCH0083 -RESEARCH0083 (2002) . 

RN [5] 

RP SEQUENCE FROM N.A. 

RG FlyBase; 



RL Submitted (SEP-2002) to the EMBL/ GenBank/DDB J databases. 

RN [6] 

RP SEQUENCE FROM N.A. 

RG FlyBase; 

RL Submitted (MAR-2004) to the EMBL/ GenBank/DDB J databases. 

RN [7] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RA Stapleton M . , Brokstein P., Hong L., Agbayani A., Carlson J., 

RA Champe M., Chavez C, Dorsett V., Farfan D. , Frise E., George R., 

RA Gonzalez M . , Guarin H., Li P., Liao G., Miranda A., Mungall C.J., 

RA Nunoo J., Pacleb J., Paragas V., Park S., Phouanenavong S., Wan K. , 

RA Yu C, Lewis S.E., Rubin G.M., Celniker S.; 

RL Submitted (DEC-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AE003804; AAF57919.1; -. 

DR EMBL; AY069356; AAL39501.1; -. 

DR HSSP; 043001; 1192. 

DR IntAct; Q9V7X0; -. 

DR FlyBase; FBgn0034179; CG6805. 

DR GO; GO:0004437; Frinositol or phosphat idylinositol phosphatas . . .; IEA. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc . 

DR Pfam; PF03372; Sxo_endojphos ; 1. 

DR SMART; SM0012 8; IPPc; 1. 

SQ SEQUENCE 357 AA; 40497 MW; 586C3CD3E7F8BA2 8 CRC64; 

Query Match 22.8%; Score 4 60.5; DB 2; Length 3 57; 

3est Local Similarity 34.0%; Pred. No. 3.5e-31; 

Matches 104; Conservative 60; Mismatches 105; Indels 37; Gaps 6; 

Qy 2 DVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLK 61 

Ml hi I = = ||||= =||: = = |:| = = : : | = :| Ml | HIM |:i | 
Db 73 DSLSDHQFVICVD3KQLQGILITMFAQHKHIPHMKEIETEATRTGLGGLWGNKGAVSIRLS 137 

Qy 62 LYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFRI 121 

Ml h = II I =|:| : =|:= : | | | | = || | |: | | | = 

Db 138 LYGTGVAFVCSHLAAHDEKLKERIEDYHQIVDNHKYNAQGYRRIFDHDFVFWFGDLNFRL 197 

Qy- 122 E - DFGLHFVRES I KNRCYGGLWEKDQLS I AKKHDPLLREFQEGRLLFPPTYKFDRNSNDY 180 

I II ::h I I = III- == M = I Ihll Mil 

Db 198 SGDMSAWDVRTDVENQRYADLLKLDQLNLLREKGNAFSLLEEQQPNFAPTFKFVEGTNDY 2 57 

Qy 181 DTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGY3SHMTYGISDHKPVSG 240 

I hill MM h: : I I I : I III I MM MM 

Db 258 NL KRRPAWCDR I LHRVQ SNIYPGITLSANQLSYQSHMDYTLSDHKPVSA 306 

Qy 241 TFDLELKPLVSAPLI VLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDY 300 

Ih =:= M I I I hM I 
Db 3 07 TFNYKVE AANQTYTDEELHEMTHGSASS - PATP NVS 3 41 

Qy 301 VSYAWV 306 

MMM 

Db 342 LSFAFV 347 



RESULT 11 
Q86YE1 



ID Q86YE1 PRELIMINARY; PRT; 736 AA . 

AC Q86YE1; 

DT 01-JUN-2003 (TrEMBLrel . 24, Created) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE INPP5B protein (Fragment) . 

GN Name=INPP5B; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph; 

RX MEDLINE=223 8 82 57; PubMed=12477932 ; DOI=10 . 1073 /pnas . 242 603 8 99; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H. , Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. # Ketteman M. # Madan A., Rodrigues S., Sanchez A.., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E. , Schnerch A., Schein J.E., . 

RA Jones S.J., Marra M.A.; 

RT ''Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL ?roc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC ?ISSUE=Lymph; 

RA Strausberg R.; 

RL Submitted (JAN-2003) to the 'EMBL/GenBank/DDBJ databases. 

DR EMBL; BC042529; AAH42529.2; -. 

DR IISSP; 043001; 1I9Z . 

DR GO; GO: 0004437; F: inositol or phosphatidylinositol phcsphatas . . .; IEA. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc. 

DR InterPro; IPR000198; RhoGAP. 

DR InterPro; IPR008936; Rho_GAP . 

DR Pfam; PF03372; Exo_endo _jphos ; 1. 

DR Pfam; PF0062 0; RhoGAP; 1. 

DR SMART; SM00128; IPPc; 1. 

DR SMART; SM00324; RhoGAP; 1. 

DR PROSITE; PS50238; RHOGAP; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 736 AA; 84286 MW; ACCA054 87 919D1D4 CRC64 ; 



Query Match 21.4%; Score 432.5; DB 2; Length 736; 

Best Local Similarity 37.0%; Pred. No. 2.4e-28; 



Matches 90; Conservative 45; Mismatches 87; Indels 21; Gaps 4; 



Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

= II :|= I I = I I = = I =| II : == Ih I I I I I I I I = = = 
Db 159 YAKVKL I RL VG I MLLL YVKQEHAAY I S E VE AETVGTG I MGRMGNKGG VAI RFQFHNTS I C 218 

Qy 69 I I NCHLPPH I SNNYQRLEHF - DRI LEMQNCE - GRD I P - -N I LDHDL I I WFGDMNFR I ED F 124 

= = l II II =1 = : I I! 1 = =1 I =11=1=1 11 = 1 = 111 = 

Db 219 VVNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISNHDVILWLGDLNYRIEEL 278 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 

: | = = h = : |= III I : I II I I 1111 = 1 hhllll 

Db 279 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 33 8 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

I I III 11111= I = = = I III M I I ■ 1 I Ih 

Db 33 9 KCRAPAWCDR I LWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 3 81 

Qy 245 ELK 247 

Db 3 82 GVR 3 84 

RESULT i:: 
QSFD53 . 

ID Q6PD5 3 PRELIMINARY; PRT ; 74 8 AA . 

AC Q6PD5^; 

DT 05-JUT.-2004 (TrEMBLrel . 27, Created) 

DT 05 -JUL-2004 (TrEMBLrel. 27. Last sequence update) 

DT 05-iTU:.-2004 (TrEMBLrel. 27, Last annotation update) 

DH Inositol polyphosphate -5 -phosphatase , 7 5kDa. 

GN Name^INPP5B; 

OS Homo sapiens (Human) . 

OC ^ukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; : 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE-Lymph; 

RX MEDLINE=22388257; PubMed=124 77 932 ; DOI=10 . 1073/pnas . 242603 899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D. , 

RA Altschul S.F. # Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L.. 

RA Stapleton M . , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E . 

RA -Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J./ 

RA Bosak S.A., McEwan P.J., McKernan K. J. , Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C. , Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whicing M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G. , 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA. Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A.; 



RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Lymph; 

RA Director MGC Project; 

RL Submitted (OCT-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC058932; AAH58932.1; -. 

DR GO; GO:0004437; Fcinositol or phosphat idylinositol phosphatas . . .; IEA. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc. 

DR InterPro; IPR001220; Lectin_legB. 

DR Pfam; PF03372; Exo_endo _jphos ; 1. 

DR SMART; SM00128; IPPc; 1. 

DR PROSITE; PS00307; LECTIN_LEGUME_BETA; UNKNOWN_l . 

SQ SEQUENCE 748 AA; 85028 MW; 676BD7CD9FE2 95 8E CRC64 ; 

Query Match 21.4%; Score 432.5; DB 2; Length 748; 

Best Local Similarity 37.0%; Pred. No. 2.4e-28; 

Matches 90; Conservative 45; Mismatches 87; Indels 21; Gaps 4; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

: II :|= ||:||:: I =1 II = == Ih I llllll I = = = 
Db 336 YAIO/KLIRLVGIMLLLYVKQEHAAYISEVEAETVGTGIMGRMGNKGGVAIRP'QFHNTSIC 3 95 



Qv 69 IINCHLPPHISNNYQRLEHF-DRILEMQNCE-GRDIP- -NILDHDLIIWFGDMNFRIEDF 124 

-III II =1 = = I II h :| I =11 = 1 = 1 11 = 1-111- 

Db 3 96 WNSHLAAHIEEYERRNQDYKDICSRMQFCQFDPSLPPLTISNHDVILWLGDLNYRIEEL 455 

Qy 12 5 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE lb4 

: | = = |= = = h III I = Mill lllhl hhllll 

Db 455 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 515 

Qv 135 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

I I III MM h I = = = I III -111111 Ih 

Db 516 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 558 

Qy 245 ELK 247 

Db 559 GVR 561 



RESULT 13 
Q6P6D4 

ID Q6PCD4 PRELIMINARY; PRT; 871 AA. 

AC QSP6D4; 

DT 05-JUL-2004 (TrEMBLrel . 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE INPP5B protein. 

GN Name-INPP5B; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_Tax I D = 9 6 0 6 ; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Skin; 

RX MEDLINE=223 882 57; PubMed=124 77 932 ; DOI=10 . 1073/pnas .242 603 899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J. , Hsieh F . , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L . , 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D. # Mullahy S.J., 

RA Bosak S.A. # McEwan P.J., McKernan K. J. , Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M . , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Skin; 

RA Director MGC Project; 

RL Submitted (NOV-2003) to the EMBL/ GenBank/DDB J databases. 

DR ?:M3L; BC062317; AAH62317.1; 

DR GO; GO:0004437; Fiinositol or phosphat idyliuositol phosphatas . . .; XEA. 

DR InterPro; 1PR005135; Exo_endo_j?hos . 

DR InterPro; IPR0C0300; IPPc. 

DR InterPro; IPR001220; Lectin_legB . 

DR In:er?ro; IPR0CC198; RhoGAP . 

DR InterPro; IPRC08936; Rho_GAP . 

DR Pfam; PF03372; Exo_endo _j>hos ; 1. 

DR "Pfam; PF00620; RhoGAP; 1. 

DR ; SMART; SM0012 8; IPPc; 1. 

DR SMART; SM00324; RhoGAP; 1. 

DR PROSITE; PS00307; LECTIN_LEGUME_BETA; UNKN0WN_1 . 

DR PROSITE; PS50238; RHOGAP; 1. 

SQ SEQUENCE 371 AA; 99120 MW; C9CF9E043 0097153 CRC64; 

Query Match 21.4%; Score 432.5; DB 2; Length 871; 
Besc Local Similarity 37.0%; Pred. No. 3e-28; 

Matches 90; Conservative 45; Mismatches 87; Indels 21; Gaps 4; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 



Db 



336 



YAKVKL I RL VG I MLLL YVKQEHAAY I S E VE AETVGTG IMGRMGNKGGVA I RFQFHNTS I C 3 95 




Qy 



69 



I INCHLPPH I SNNYQRLEHF - DR I LEMQNCE - GRD I P - - N I LDHDLI I WFGDMNFR I EDF 124 



Db 



396 



WNSHLAAH I EE YERRNQD YKD I CSRMQFCQPDPS LPPLT I SNHDVI LWLGDLNYR I EEL 455 




Qy 



125 



GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 



h III I 



Db 4 56 DVEKVKKLIEEKDFQMLYAYDQLKIQVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 515 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

I I III Mllh I I III lllllll Ih 

Db 516 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 558 

Qy 245 ELK 247 

Db 559 GVR 561 

RESULT 14 
I5P2_HUMAN 

ID I5P2_HUMAN STANDARD; PRT; 942 AA. 

AC P32019; 

DT 01-JUL-1993 (Rel . 26, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Type II inositol - 1 , 4 , 5 -trisphosphate 5 -phosphatase precursor 

DE (EC 3.1.3.36) (Phosphoinositide 5 -phosphatase) (SPTase) (Fragment). 

CN Name=INPP5B; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NC3I_TaxID-9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX :J2DLINE=95238452; PubMed=772 1860 ; DOI=10 . 1074/ jbc . 270 . 16 . 9370 ; . 

HA Jefferson A.B., Kajerus P.W.; 

RT . "Properties of type II inositol polyphosphate 5 -phosphatase ." ; 

RL J. Biol. Chem. 270:9370-9377(1995). 

RN [2] 

R? SEQUENCE OF 233-903 FROM N.A., AND SEQUENCE OF 270-288. 

RC TIS£UE=Placenta; 

RX MED LI NE = 92041857; PubMe d =1718960; 

RA Ross T.S., Jefferson A.B., Mitchell C.A., Majerus P.W.; 

RT "Cloning and expression of human 75-kDa inositol polyphosphate-5- 

RT phosphatase . " ; 

RL J. Biol. Chem. 266:2 0283-202 89(1991). 

CC -!- FUNCTION: Hydrolyzes the calcium-mobilizing second messenger 

CC Ins (1 , 4 , 5 ) P3 , this is a signal -terminating reaction. 

CC -!- CATALYTIC ACTIVITY: 1 -phosphatidyl - lD-myo- inositol 4,5- 

CC bisphosphate + H(2)0 = 1 -phosphatidyl- ID-myo-inositol 4-phosphate 

CC - + phosphate . 

CC •--!- TISSUE SPECIFICITY: Platelets. 

CC - -!- SIMILARITY: Belongs to the inositol - 1 , 4 , 5 -trisphosphate 5- 

CC ■*" phosphatase type II family. 

CC -!- SIMILARITY: Contains 1 Rho-GAP domain. 

CC 

' CC This 3WISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR EMBL; M74161; AAA79207.1; -. 

DR HSSP; 043001; 1I9Z. 

DR Genew; HGNC:6077; INPP5B . 

DR MINI ; 147264; 

DR GO; GO:0016021; C : integral to membrane; NAS . 

DR GO; GO:0004445; F : inositol -polyphosphate 5 -phosphatase activity; NAS. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc. 

DR InterPro; IPR000198; RhoGAP . 

DR InterPro; IPR00893 6; Rho_GAP. 

DR Pfam; PF03372; Exo_endo_phos ; 1. 

DR Pfam; PF00620; RhoGAP; 1. 

DR SMART; SM0012 8; IPPc; 1. 

DR SMART; SM00324; RhoGAP; 1. 

DR PROSITE; PS502 38; RHOGAP; 1. 

KW Direct protein sequencing; Hydrolase. 

FT N0N_TER 1 1 

FT PROPEP <1 2 69 

FT CHAIN 270 942 Type II inositol-1 , 4 , 5- trisphosphate 5- 

FT phosphatase. 

FT DOMAIN 770 942 Rno - GAP . 

FT CONFLICT 53 6 555 GSDDWDTSEKCRAPAWCDRI -> RALTTGI PVRSAVLLPG 

FT VIGF (in Ref . 2) . 

FT CONFLICT 860 860 G -> P (in Ref. 2). 

CQ SEQUENCE 942 AA; 106614 MW; 4 16F9F934E45092 3 CRC64; 

Query Match 21.4%; Score 432.5; DB 1; Length 942; 

Bent local Similarity 37.0%; Pred. No. 3.3e-28; 

Hao^het; 90; Conservative 45; Mismatches 87; Indels 21; Gaps 4; 

Qy 9 FIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

: M =|= || = lh: I :| II : == Ih I MINI I = : = 
Db 365 YAKVKL I RL VG I MLLL YVKQEHAAY I S E VE AETVGTG I MGRMGNKGG VA I RFQFHNTS I C 424 

Qy 69 IINCHLPPHISNNYQRLEHF-DRILEMQNCE-GRDIP- -NILDHDLIIWFGDMNFRIEDF 124 

::| II II :| : = I II h : I I : I I * I • I 11 = 1-111 = 

Db 425 WNSHLAAHIEEYERRNQDYKDICSRMQFCQPDPSLPPLTISNHDVILWLGDLNYRIEEL 4 84 

Qy 125 GLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSE 184 

: | : : | : : : I INI : I II I I 1111 = 1 —III 

Db 4 85 DVEKVKKL I EEKDFQML YAYDQLKI QVAAKTVFEGFTEGELTFQPTYKYDTGSDDWDTSE 544 

Qy 185 KKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDL 244 

I I III llllh I ^ I III Illllll Ih 

Db 545 KCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVSSVFDI 587 

Qy 245 ELK 247 

Db 588 GVR 590 



RESULT 15" 
Q8CF65 

ID Q8CF65 PRELIMINARY; PRT; 742 AA. 

AC Q3CF65; 

DT Cl-MAR-2003 (TrEMBLrel. 23 , Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 



DT Ol-OCT-2003 (TrEMBLrel . 25, Last annotation update) 

DE Mus musculus adult male lung cDNA, RIKEN full-length enriched library, 

DE clone : 1200004L23 product : inositol polyphosphate -5 -phosphatase , 75 kDa, 

DE full insert sequence. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata,- Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ,- Muridae; Murinae; Mus. 

OX NCB I _Tax I D = 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Lung; 

RX MEDLINE=992792 53; PubMed=1034 963 6 ; DOI=10 . 1016/S0076-6879 ( 99) 03 004-9; 

RA Carninci P., Hayashizaki Y.; 

RT "High-efficiency full-length cDNA cloning."; 

RL Meth. Enzymol. 303:19-44(1999). 
[2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Lung; 

RX MEDLINE=21085660; PubMed=11217851 ; DOI=10 . 103 8/350555 00 ; 

RA RIKEN FANTOM Consortium; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; T IS SUE = Lung; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse trans crip tome based on functional annotation of 

RT 6 0,770 full -length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [4] 

R? SEQUENCE FROM N.A. 

RC 3 TRA1 N - C 5 7 BL / 6 J ; TISSUE=Lung; 

RX M3DLINE---20499374; PubMed=11042 159 ; DOI=10 . 1101/gr . 145100 ; 

RA Carninci P., Shibata Y., Hayatsu N . , Sugahara Y . , Shibata K. , Itoh M. , 

RA Konno H. . Okazaki Y., Muramatsu M. , Hayashizaki Y.; 

RT "Normalization and subtraction of cap-trapper-selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes. 11 ; 

RL Genome Res. 10:1617-1630(2000). 

RN [5] 

RP ■ SEQUENCE FROM N.A. 

RC STRAIN-C57BL/6J; TISSUE=Lung ; 

RX MEDLINE-20530913; PubMed=11076S61 ; DOI=10 . 1 10l/gr . 152 600 ; 

RA Shibata K. , Itoh M. , Aizawa K. , Nagaoka S., Sasaki N. , Carninci P., 

RA Konno H. , Akiyama J., Nishi K. , Kitsunai T. , Tashiro H. , Itoh M., 

RA Sumi N., Ishii Y., Nakamura S., Hazama M., Nishine T. , Harada A., 

RA Yamamoto R. , Matsumoto H. , Sakaguchi S., Ikegami T., Kashiwagi K. , 

RA Fu j iwake S., Inoue K. , Togawa Y., Izawa M., Ohara E., Watahiki M., 

RA Ycneda Y. , Ishikawa T., Ozawa K., Tanaka T. , Matsuura S., Kawai J., 

RA Okazaki Y., Muramatsu M. , Inoue Y. , Kira A., Hayashizaki Y.; 

RT "RIKEN integrated sequence analysis (RISA) system-384 -format 

RT sequencing pipeline with 384 multicapillary sequencer. "; 

RL Genome Res. 10:1757-1771(2000). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Lung ; 

RA Adachi J., Aizawa K. , Akahira S., Akimura T. , Arai A. , Aono H. / 



RA Arakawa T. , Bono H., Carninci P., Fukuda S., Fukunishi Y . , Furuno M . , 

RA Hanagaki T., Hara A., Hayatsu N . , Hiramoto K. , Hiraoka T . , Hori F., 

RA Imotani K. , Ishii Y., Itoh M . , Izawa M., Kasukawa T., Kato H . , 

RA Kawai J., Kojima Y., Konno H. , Kouda M . , Koya S., Kurihara C. , 

RA Matsuyama T. , Miyazaki A. , Nishi K. , Nomura K. , Numazaki R. # Ohno M . , 

RA Okazaki Y. , Okido T., Owa C, Saito H., Saito R., Sakai C, Sakai K. , 

RA Sano H., Sasaki D. , Shibata K. , Shibata Y., Shinagawa A. , Shiraki T. , 

RA Sogabe Y., Suzuki H. # Tagami M . , Tagawa A., Takahashi F. , Tanaka T. , 

RA Tejima Y., Toya T. , Yamamura T., Yasunishi A., Yoshida K. , Yoshino M. , 

RA Muramatsu M. , Hayashizaki Y. ; 

RL Submitted (JUL-2000) to the EMBL/GenBank/DDBJ databases. 

DR SMBL; AK004601; BAC25089.1; -. 

DR HSSP; 043001; 1I9Z. 

DR GO; GO: 0004437; F: inositol or phosphatidylinositol phosphatas . . .; IEA. 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR000300; IPPc . 

DR InterPro; IPR000198; RhoGAP . 

DR InterPro; IPR008936; Rho_GAP . 

DR Pfam; PF03372; Exo_endo_phos ; 1. 

DR SMART; SM00128; IPPc; 1. 

DR SMART; SM00324; RhoGAP; 1. 

DR PROSITE; PS50238; RHOGAP; 1. 

SQ SEQUENCE 742 AA; 84871 MW; A2 5751E2A3C404 04 CRC64 ; 

Query Match 21.0%; Score 423.5; DB 2; Length 742; 

Best Local Similarity 35.1%; Prsd . Uo . 1.4e-27; 

.Matches 87; Conservative 43; Mismatches 87; Indels 31; Gaps. 3 

Qv 9 FIKVSHWMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVS 68 

= 11 11= 11 = 11-= I : I II = == 11= I MM!: I =|: : 
Db 165 YAK7/KF VRL VG I MLLL YVKQEFAAY I S E VEAETVGTG I MG RMGNKGG VA I RFOLHNTS I C 224 

QV <S9 IINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIP N I LDHDL 1 1 WFGDMNF 119 

= = 1111 M : • I | ! I I I = I = j 11 = 1 = 

Db 22 5 WNSHLAAHTEEYSRRNQDYRDI CSRMQFPQVDPSQPPLTINKHDVILWLGDLNY 279 

QV * 120 RIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSND 179 

111= = l = = == = = 1= MM = I I! = I 1111 = 1 1 = 1 

Db 280 RIEELDVGKVKKLVEEKAFQTLYAHDQLKIQVAARTIFDGFTEGEITFQPTYKYDTGSDD 339 

Qy 130 YDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVS 239 

= 1111! I III 11111= I = = = I III MilIM 

Db 340 WDTSEKCRAPAWCDRILWKGK NITQLSYQSHMALKTSDHKPVS 382 

Qy 24 0 GTFDLELK 24 7 

||: :: 

Db 3 311 SVFDIGVR 3 90 

Search completed: February 10, 2005, 17:53:45 
Job time : 130 sees 



